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Background of the Invention 
Fldd of the ImeuOon 

The present invention relates to recombinant DNA technology. DNA and 
vectors having engineered recombination sites arc provided for use in a 
recombinational cloning method that enables efficient and specific recombination 
of DNA segments using recombination proteins. The DNAs, vectors and 
me&ods are usefiil for a variety of DNA exchanges, sudi as subcloning of DNA, 
in vitro or in vivo. 

ReiaUdArt 

SUe specffk recontbinases. Site speci&c lecombinases are en^mes ftat 
are present in some viruses and bacteria and have been characterized to have both 
endonuclease and ligase properties. These recombinases (along with associated 
proteins in some cases) recognize specific sequences of bases in DNA and 
exchange the DNA segments flaiddng those segments. The recombinases and 
associated proteins are collectively referred to as "recombination proteins" (see, 
e.g.„ Landy, A., Curretit Opinion in Biotechnology 5:699-707 (1993)). 

Numerous recombination systems from various organisms have been 
described. See. e.g.. Hoess etal.. Nucleic Acids Research 74(6):2287 (1986); 
Abremski et oL, J. Biol Chem.261{l):39l (1986); Campbell, J. 
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BacterioL /7<23):7495 (1992); QianerdL, J BioL Chem. ^tf7(ll):7794 (1992); 
Araldc/«£.^ Mol. Biol. 22J(1):25 (1992y,Maeser and Kahmnam (1991) M>/. 
Gen. Genet. 250:170-176). 

Many of these belong to fbe integrase family of recombinases 
5 (Aigos etd. EMBOJ. 5:433-440 (1986)). PeAaps flie best studied of these axe 

the Integrase/aff system fiom bacteriophage X (Landy, A. Current Opinions in 
Genetics and Devel. i:699-707 (1993)), the Cre/loxP system fiom bacteriophage 
PI (Hoess and Abremski (1990) In Nucleic Acids and Molecular Biology, vol. 4. 
Eds.: Edcstein and Lilley, Beriin-Heidelbeig: Springer-Verlag; pp. 90-109) , and 
10 the FLP/FRT system fmm the Saccharomyces cerevisiae 2 (i ciicle plasmid 

(Broach et aL CeU 2P:227-234 (1982)). 

\Vhile Aese icconibination systenis have been diaiactenzed p^^ 
organisms, die related art has only taught using recombinant DNA flanked by 
recombination sites, for in vivo recombinatioa 
15 Backman (U.S. Patent No. 4,673,640) discloses the in v/vo use of A. 

recombinase to recombine a protein produdng DNA segment by en^matic site- 
specific recombuiation using wild-type recombination sites attB and attP. 

Hasan and Szybalski (Gene 5tf:145-151 (1987)) discloses fte use of A, Int 
recombuMse in vivo for intramolecular recombination between wild type attP 
20 and attB sites vMch flank a promoter. Because the orientations of diese sites are 

inverted relative to each other, this causes an irreversible flippmg of flie promoter 
region relative to the gene of mterest 

Palazzolo et al. Gene 88:25-36 (1990), discloses phage lambda vectors 
. having bacteriophage X arms that contain restriction sites positioned outside a 
25 cloned DNA sequence and between wild-type loxP sites. Infection of £ coli cells 

that express the Cre recombinase with these phage vectors results in 
recombination between the loxP sites and the in vivo excision of the plasmid 
replicon, including the cloned cDNA. 

Posfai et al (Nucl. Acids Res. 22:2392-2398 (1994)) discloses a method 
30 for inserting into genomic DNA partial expression vectors having a selectable 

marker, flanked by two vtild-type FRT recognition sequences. FLP site-specific 
recombinase as present in the cells is used to integrate the vectors into the 
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genome at predetonuned Axes. XJoda conditions where &e leplicon is 
functional, tills cloned genomic DNA can be anq>lified. 

Bebee etaL (US. Patent No. 5,434,066) discloses tfw use of site-qiecific 
tecombinases such as Cre for DNA containii% two laxP sites is used for in vivo 
5 recombination between tiie dtes. 

Boyd (Nucl. Acids Res. 27:817-821 (1993)) discloses a method to 
facilitate the cloning of blunt-ended DNA using conditions that encourage 
inteimolecular ligation to a dephosphorylated vector that contains a wild-type 
loxP site acted upon by a Cre site-specific lecombinase present in R coli host 
lb cells. 

Wateibousec/at (PCTNo. 93/19172 andM<cfeiCi4cttfcXei. 21 (9)0265 
(1993)) disclose an to vnw metiiod viieie light and heavy chains of a particular 
antibody vireie cloned in diflferent phage vectors between laxP and loxP 511 sites 
and used to transfect new E coli cells. Cre, acting in the host cells on the two 
IS parental molecules (one plasmid, one i^e), produced four products in 

equilibrium: two dififerent comt^rates (produced by recombination at either laxP 
or loxP 511 sites), and two daughter molecules, one of viuch was tiie desired 
product 

In contrast to the other related art, Schlake & Bode (Biochemistry 
20 55:12746-12751 (1994)) discloses an in vivo method to exchange e3q)ression 

cassettes at defmed chromosomal locations, each flanked by a wild ^e and a 
spacer-muuted FRT recombination site. A double-reciprocal crossover was 
mediated in cultured mammalian cells by usmg tius FLP/FRT system for site- 
specific recombination. 
25 Transposases. The family of enzymes, the transpbsases, has also been 

used to transfer genetic infomiation between replicons. Transposons are 
structurally variable, being described as simple or compound, but typically 
encode the recombinase gene flanked by DNA sequences organized in inverted 
orientations. Integration of transposons can be random or highly specific. 
30 Representatives such as Tn7, which arc lughly site-specific, have been applied to 

the in vivo movement of DNA segments between replicons (Lucklow et al., 
J. ViroL tf7:4566-4579(1993)). 
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Devine and Boeke Nucl Acids Res. 22:3765-3772 (1994), discloses the 
construction of artificial transposons for tire insertion of DNA segments, in vitro, 
into tecipient DNA molecules. The system makes use of the integrase of yeast 
TYl vims-like particles. The DNA segment of interest is cloned, using standard 
methods, betweoi the ends of fte liansposon-like element TYl . In the presence 
of the TYl integrase, the resulting element integrates nmdomly into a second 
target DNA molecule. 

DAM doniag. The cloning of DNA segments currently occurs as a daily 
routine in many research labs and as a prerequisite step in many genetic analyses. 
Hk pmpose of diese clonings is various, however, two general puiposes can be 
considered: (1) the initial cloning of DNA fiom large DNA or RNA segments 
(chramosomes, YACs, PGR fiagments, mRNA. etc.), done in a relative faandfid 
of known vectors such as pUC, pGem. pBlueScript, and ^i) the subdoning of 
these DNA segmoits into specialized vectors for functional analysis. A great 
deal of time and effort is expended botii in tiie initial cloning of DNA segments 
andin tiie transfo' of DNA s^moils fiom tiie initial cloning vectors to the more 
specialized vectors. This transfer is called subcloning. 

The basic methods for cloning have bem known for many years and have 
changed little during that time. A typical cloning protocol is as follows: 

(1) digest the DNA of interest with one or two restriction 
en^es; 

(2) gel purify the DNA segment of interest when known; 

(3) prepare the vector by cutting wifli appropriate restriction 
enzymes, treating with alkaline phosphatase, gel purify etc., as 
iq)pK>priate; 

(4) ligate the DNA segment to vector, wifli appropriate controls 
to estimate background of uncut and self-ligated vector; 

(5) introduce the resulting vector into an £ cob' host cell; 

(6) pick selected colonies and grow small cultures overnight; 

(7) make DNA minipreps; and 

(8) analyze the isolated plasmid on agarose gels (often after 
diagnostic restriction enzyme digestions) or by PGR. 



wo 96/40724 



-5- 



PCrAJS96ft0082 



Tbe spedalized vectors used for subcloning DNA s^ments are 
fimctiaiiaUy divose. These include but are not limited to: vectors for e]q)ressing 
genes in various organisms; for r^ulating gene e}q)ression; for providing tags to 
dd in protein purification or to allow tracking of proteins in cells; for modifying 
flie cloned DNA segment (e.g., gaieraling deletions); for the syn&esis of probes 
{e.g., riboprobes); for the preparation of templates for DNA sequencing; for the 
identification of protein coding regions; for the fusion of various protein-coding 
regions; to provide large amoimts of the DNA of interest, etc. It is common diat 
a particular investigation will involve subcloning the DNA segment of inteiest 
mto several difEeient specialized vectors. 

As known in the ai^ foraple sufaclonings can be done in one day (e.£, die 
DNA segment is not large and die restriction sites are conqratible widi diose of 
die subcloning vector). However, many odier siibclonings can take several 
wedcs, especially diose involving unknown sequences, long fiagments, toxic 
genes, unsuitable placement of restriction sites, high badcgrounds, impure 
en^rmes, etc. Subcloning DNA fiagments is thus often viewed as a chore to be 
done as few times as posable. 

Several methods for fecilitating die cloning of DNA segments have been 
described, e.g., as in the following references. 

Ferguson, J., et d. Gene 16:\9\ (1981), discloses a family of vectors for 
subcloning fragments of yeast DNA. The vectors encode kanamycin resistance. 
Clones of longer yeast DNA segments can be partially digested and ligated into 
the subcloning vectors. If the original cloning vector conveys resistance to 
ampicillin, no purification is necessary prior to transformation, since the selection 
will be for kanamycin. 

Hashimoto-Gotoh, T., et al. Gene 41:125 (1986), discloses a subcloning 
vector with unique- cloning sites widiin a streptomycin sensitiidty gene; in a 
streptomycin-resistant host, only plasmids widi inserts or deletions in die 
dominant sensitivity gene vnll sur^dve streptomycin selection. 

Accordingly, traditional subcloning methods, using restriction enzymes 
and ligase, are time consuming and relatively unreliable. Considerable labor is 
emended, and if two or more days later die desired subclone can not be found 



among tiie candidate plasmids, the entire process must then be repeated wi& 
alternative conditions attempted. Although site specific recombinases have been 
used to lecombine DNA in vivo, the successM use of sudi enzymes in vitro was 
aq)ected to suffer from several problems. For example, &s ste specificities and 
efficiendes were e^qsected to differ in vitro; topologically-linked products were 
expected; and the topology of the DNA substrates and recombination proteins 
was expected to differ significantly in vitro (see. e.g., Adams et al, J. Mol. 
Biol. 226:661-13 (1992)). Reactions that could go on for many hours in vivo 
were expected to occur in significantly less time in vi6ro before the enzymes 
became inactive. Multiple DNA reconibinatioa products were expecAadi in the 
biological host used, resulting in unsatisfiictoiy reliability, specificity or 
effidency of subdomng. /n vitro tecombinatioa reactions were not ejqiected to 
be suffidenUy efficient to yield the desired levels of product 

Accordingly, tiiere is a long felt need to provide an alternative subcloning 
system that provides advantages over tiie known use of restriction enzymes and 
ligases. 

Summary of (he Invention 

The present invention provides nudeic acid, vectors and methods for 
obtaining chimeric nucleic acid using recombination protdns and engineered 
recombination sites, in vitro or in vivo. These metiiods are highly specific, rapid, 
and less labor intensive than what is disclosed or suggested m the related 
background art The improved specificity, speed and yields of the present 
invention fecilitates DNA orRNA subcloning, regulation or exchange useful for 
any related piupose. Such purposes include in vitro recombination of DNA 
segments and in vitro or in vivo insertion or modification of transcribed, 
replicated, isolated or genomic DNA or RNA 

Hie present invention relates to nucleic acids, vectors and methods for 
moving or exchanging segments of DNA using at least one engineered 
recombination site and at least one recombination protein to provide chimeric 
DNA molecules which have tiie de^red charactaistic(s) and/or DNA segment(s). 



Generally, one or more parent DNA molecules are tecombihed to give one or 
niore daughter molecules, at least one of wbich is tiie desired Product DNA 
segment or vector. The invaition thus relates to DNA, RNA, vectors and 
methods to effect the exchange and/or to select for one or more desired products. 

One embodiment of the present invention relates to a method of making 
chimeric DNA, whidi comprises 

(a) combining in vi/ro or in vivo 

0) an Insert Itonor DNA molecule, comprising a desired DNA 
segment flanked by a first recombination site and a second recombination site, 
i^erein flie first and second recombination sites do not rebombine with each 
other, 

00 a Vector Donor DNA molecule containing a third 
recombination ate and a fourth recombination site, vriieim the third and fourth 
recombination sites do not recombine with each otfaen and 

(iii) one or more ate specific recomUnation proteins capable 
of recombining the first and third recombinational sites and/or the second and 
fourth lecombinational sites; 

thereby allowing recombination to occur, so as to produce at least one 
Cointegrate DNA molecule, at least one desired Product DNA molecule which 
comprises said desired DNA segment, and optionally a Byproduct DNA 
molecule; and tfien, optionally, 

(b) selecting for the Product or Byproduct DNA molecule. 
Another embodiment of the present mvention relates to a kit comprisii^ 

a carrier or receptacle being compartmentalized to receive and hold therein at 
least one container, wherein a first container contains a DNA molecule 
comprising a vector having at least two recombination sites flanking a cloning 
site or a Selectable marker, as described herein. The Idt optionally fiirther 
comprises: 

(i) a second contamer containing a Vector Donor plasmid 
compriang a subdoning vector and/or a Selectable marker of which one 
or botii are flanked by one or more engineered recombination sites; and/or 



00 tUmd container containing at least one recoiorimiation protein 
vMch recognizes and is capable of tecombining at least one of said 
recombination sites. 

CMier embodiments include DNA and vectors usefiil in the mediods of tlie 
present invention. In particular. Vector Donor molecules are provided in one 
embodiment, vdierein DNA segments within the Vector Donor are separated 
either by, Q) in a circular Vector Donor, at least two recombination sites, or (ii) 
in a linear Vector Donor, at least one recombination site, where the recombination 
sites are preferably engineered to enhance specificity or efficiency of 
lecombmation. 

One Vector Donor embocUment comprises a first DNA segment and a 
second DNA segment, die first or second segment compriang a Selectable 
maricer. A second Vector Donor embodiment comprises a first DNA segment 
and a second DNA segment, tiiiB first or second DNA segment comprising a toxic 
getfe. A Aird Vector Donor embodiment comprises a first DNA segment and a 
. second DNA segment, the first or second DNA segment comprising an inactive 
fiagmeot of at least one Selectstble marker, wherein the inactive fiagment of the 
Selectable madcer is capable of reconstituting a fimctional Selectable marker 
when recombined anoss the first or second recombination site witii another 
inactive fiagment of at least one Selectable marker. 

The present recombinational cloning method possesses sevoal advantages 
over previous in vivo metiiods. Since single molecules of recombination products 
can be introduced into a biological host, propagation of the desired Product DNA 
in the absence of other DNA molecules (e.g., starting molecules, mtermediates, 
and by-products) is more readily realized. Reaction conditions can be fiwly 
adjusted in vitro to optimize enzyme activities. DNA molecules can be 
incompatible wifli the desired biological host (e.g., YACs, genomic DNA, etc.), 
can be used. Recombination proteins fiom diverse sources can be employed, 
togetiier or sequentially. 

Otiier embodiments will be evident to those of ordinary skill in the art , 
fiom die teadiings contained heam in combination witii what is known to the art 
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Brief Descryrtion of the Figures 

Figure 1 depicts one general melfaod of die present invention, wherein ifae 
starting (parent) DNA molecules can be circular or linear. The goal is to 
exchange the new subcloning vector D for the original cloning vector B. It is 
desirable in one embodiment to select for AD and t^ainst all the other molecules, 
including the Coint^rate. The square and drcle are sites of recombination: e.g., 
loxP sites, an sites, etc. For example, segment D can contain expression signals, 
hew drug markers, new origins of replication, or specialized functions for 
napping or sequencing DNA. 

Figure 2A depicts an in vitro method of recombining an Insert Donor 
plasmid Oiere, pEZC70S) wifli a Vector Donor plasmid (here, pEZC726), and 
obtaining Product DNA and Byproduct daughter molecules. The two 
iecombiniUdon sites are attP and loxP on die Vector Donor. On one segment 
defined by these sites is a kanamycin resistance gene whose promoter has been 
replaced by the tetOP operator/promoter fiom transposon 7>il0; See Sizemore 
et aL, NucL Acids Res. 7^10):287S (1990). In die absence of tet repressor 
protein, E coli RNA polymerase transcribes the kanamycm resistance gene fiom 
die tetOP. If tet repressor is ptesoit, it binds to tetOP and blocks transcription of 
the kanamycin resistance gene. The other segment of pEZC726 has the tet 
repressor gene expressed by a constitutive promoter. Thus cells transformed by 
pEZC726 are resistant to chloramphenicol, because of the diloramphenicol acetyl 
transferase gene on the same segment as tetR, but are sensitive to kanamycin. 
The recombinase-mediated reactions result in separation of the tetR gene from the 
regulated kanamycin resistance gene. This separation results in kanamycin 
resistance m cells receiving only the desired recombination products. The first 
recombination reaction is driven by the addition of the recombinase called 
Integrase. The second recombination reaction is driven by adding the 
recombinase Cre to the Cointegrate (here, pEZC7 Cointegrate). 

Figure 2B depicts a restriction map of pEZC70S. 

Figure 2C depicts a restriction map of pEZC726. 

Figure 2D depicts a restriction map of pEZC7 Cointegrate. 
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Figure 2E depicts a restriction map of Intprod. 

Figure 2F depicts a restriction map of Intbypro. 

Figure 3A depicts an in vitro method of lecombining an Insert Donor 
plasmid (here, pEZC602) wifli a Vector Donor plasmid (here, pEZC629), and 
obtaining Product (here, EZC6prod) and Byproduct (here, EZC6Bypr) daughter 
molecules. The two recombination sites are /ox/» and /ocP J77. One segment of 
pEZC629 defined by these sites is a kanamycm resistance gene whose promoter 
has been replaced by the tetOP operatoi^moter fiom transposon TnlO. In the 
absence of tet repressor protein, K coli RNA polymerase transcribes the 
kanamydn resistance gene fiom die tetOP. If tet repressor is present, h binds to 
tetOP and blodcs transcription of the kanamycin resistance gene. Hie o&er 
segment of pEZC629 has tiie tet iq>ressor gene ejqnessed by a constitutive 
promoter. Thus cells transfocmed by pEZC629 are resistant to diloramphenicol, 
because of the chloramphenicol acetyl tiansferase gene on the same segmoit as 
tetR, but are soisitive to kanamycm. Hie reactions result in sqKuation of tiie tetR 
gene fi:om the regulated kanamycin resistance gene. This separation results in 
kanamydn resistance in cells receiving the desired recombination product Hie 
first and the second recombination events are driven by the addition of the same 
recombinase, Cie. 

Figure 3B depicts a restriction map of EZC6Bypr. 

Figure 3C depicts a restriction map of EZC6prod. 

Figure 3D depicts a restriction map of pEZC602. 

Figure 3E depicts a restriction map of pEZC629. 

Figure 3F depicts a restriction map of EZC6coint 

F^re 4A depicts an application of the ih vitro method of 
recombinational cloning to subclone the chloramphenicol acetyl transferase gene 
into a vector for expression in eukaryotic cells. The Insert Donor plasmid, 
pEZC843, is comprised of the diloramphenicol acetyl tiansfoase goie of £ coli, 
cloned between hxP and attB sites such that the ImP site is positioned at flie 
5'-end of tiie gene. The Vector Donor plasmid, pEZClOOB, contains the 
cytomegalovirus eukaryotic promoter apposed to a loxP site. The supercoiled 
plasmids were combined witii lambda Integrase and Cre recombinase in vitro. 
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After incubation, competent K coU cells were tiansfonned with the 
recombinational reaction solution. Aliquots of transformations were spread on 
agar plates containing kanamycin to select for the Product molecule Qasic 
CMVPiod). 

Figure 4B depicts a restriction map of pEZC843 . 
Figure 4C depicts a restriction map of pE2^1003. 
Figure 4D depicts a restriction vosg of CMVBypro. 
Figure 4E depicts a restriction ms^ of CMVPtod. 
F^ure 4F depicts a restriction map of CMVcoint 
Figure SA depicts a vector diagram of pEZC1301 . 
Figure SB depicts a vector diagram of pEZClSOS. 
Figure SC dq>icts a vector diagram of pEZC1309. 
Figure 5D depicts a vector diagram of pEZC1313. 
Figure 5E depicts a vector diagram of pEZC1317. 
Figure 5F depicts a vector diagram of pEZC1321 . 
Figure 5G depicts a vector diagram of pEZC140S. 
Figure 5H depicts a vector diapam of pEZClS02. 
Figure 6A depicts a vector diagram of pEZC1603. 
Figure 6B depicts a vector diagram of pEZC1706. 
Figure 7A depicts a vector diagram of pEZC2901. 
Figure 7B depicts a vector diagram of pEZC2913 
Figure 7C depicts a vector diagram of pEZC3 101 . 
Figure 7D depicts a vector diagram of pEZC1802. 
Figure 8A depicts a vector diagram of pGEX-2TK. 
Figure 8B depicts a vector diagram of pEZC3501 : 
Figure 8C depicts a vector diagram of pEZC3601. 
Figure 8D depicts a vector diagram of pEZC3609. 
Figure 8E depicts a vector dis^ram of pEZC3617. 
Figure 8F depicts a vector diagram of pEZC3606. 
Figure 8G depicts a vector diagram of pEZC3613. 
Figure 8H depicts a vector diagram of pEZC362I . 
Figure 81 depicts a vector diagram of GST-CAT. 
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Figore 8J dq>icts a vector diagram of GST-phoA. 
Figure 8K depots a vector diagram of pEZC3201. 

Detailed Description of the Prtf erred Embodiments 

It is imejqiectediy discovered in the present invention that subcloning 
reactions can be provided using recombinational clonii^. Recombination cloning 
according to tiie present invoition uses DNAs, vectors and methods, in vitro and 
in vivo, for moving or exchan^ng segments of DNA molecules usmg engineered 
recombination sites and recombination proteins. These methods provide chimeric 
DNA molecules that have the desired characteristic(s) and/or DNA segment(s). 

The present invendon thus provides micldc acid, vectors and methods for 
obtaining diimeric nucleic acid using recombination proteins and engineered 
lecombiiialion sites, in vitro or in vivo. Ttese mediods are hi^y specific, rapid, 
and less labor intensive dian vrbst. is disclosed or suggested in die related 
background art The improved spedficity, speed and yields of die present 
invention fiualitates DNA or RNA subclotung, r^uUrtbn or exchange usefiil for 
any related purpose. Such purposes include in vitro recombination of DNA 
segments and in vitro or in vivo insertion or modificadon of transcribed, 
replicated, isolated or genomic DNA or RNA. 

Definitions 

In the description that follows, a number of terms used in recombinant 
DNA technology are utilized extensively. In order to provide a clear and 
consistent imderstanding of the specification and claims, including die scope to 
be given such terms, the following definitions are provided. 

Byproduct: is a daughter molecule (a new clone produced after the 
second recombination event during die recombinational cloning process) lacking 
the DNA which is desired to be subcloned. 
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Cointegrate: is at least one lecombination intennediate DNA molecule 
of the present inventioii that contains bodi parental (starting) DNA molecules. 
It will usually be circular. In some embodiments it can be linear. 

Host: is any prokaryotic or eukaiyotic organism thqt can be a recipient 
of the recombinational cloning Product A "host," as the tetin is used herein, 
includes prokaryotic or eukaiyotic organisms that can be genetically engineered. 
For examples of such hosts, see Maniatis et al., Molecular Cloning: A Laboratory 
Manual, Cold Spring Harbor Laboratory, Cold Spring Harbor, New Yoric (1982). 

Insert: is the desired DNA segment (segment A of Figure 1) vMch one 
vvishes to manipulate by flienietfabdofdie present invention. Tbe insert can have 
one or more genes. 

Insert Donor is one of die two parental DNA molecules of the present 
invendon which canies fbs Insert. The Insert Donor DNA molecule comprises 
the Insert flanked on both ades with lecombination signals. The Insert Donor 
can be linear or drcular. In one embodiment of die invention, die Insert Donor 
is a circular DNA molecule and fiirther comprises a cloning vector sequence . 
outside of die recombination signals (see Figure 1). . 

Product is one or both die desired daughter molecules comprising the ^ 
and D or £ and C sequences which are produced after the second recombination 
event during the recombinational cloning process (see Figure 1). The Product 
contains the DNA which was to be cloned or subcloned. 

Promoter: is a DNA sequence generally described as the 5'-region of a 
gene, located proximal to the start codoa The transcription of an adjacent DNA 
segment is initiated at the promoter region. A repressible promoter's rate of 
transcription decreases in response to a repressing agent. Ah inducible promoter's 
rate of transcription increases in response to an inducing agent A-«onstitutive 
promoter's rate of transcription is not specifically regulated, though it can vary 
under the influence of general metabolic conditions. 
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Recognition sequence: Recognition sequences are paiticular DNA 
sequences v*ich a protein, DNA, or RNA molecule (e.g., restriction 
endonuclease, a modificatioa me&ylase, or a recombinase) recognizes and binds. 
For example, the recognition sequence for Cre recombinase is loxP which is a 34 
base pair sequence comprised of two 13 base pair inverted repeats (serving as die 
recombinase binding sites) flanking an 8 base pair core sequence. See Figure 1 
of Sauer, B., Current Opinion in Biotechnology 5:521-527 (1994). Other 
exanqiles of recognition sequences are the ottB, attP, ottL, and attR sequences 
viiich are recognized by die recombinase en^rme X Integrase. attB is an 
approximately 25 base pair sequence contuning two 9 base pair core-type Int 
InndingsitesandaTbasepairovaiepRgjon. ottP is an qiproximately 240 base 
jtair sequence containing core-type Int binding sites and arm-type Int binding 
dtes as well as sites for aunliaiy proteins IHF. FIS, and Xis. See Lamfy, Current 
'Opinion in Biotechnology i:699-707 (1993). Such sites are also engineered 
according to the present invention to enhance methods and products. 

Recombinase: is an enzyme wiiich catalyzes the exchange of DNA 
segmmts at specific recombination sites. 

Recombinational Cloning: is a method described herein, whereby 
segments of DNA molecules are exchanged, inserted, rep^ced, substituted or 
modified, in vitro or in vivo. 

Recombination proteins: include excisive or integrative proteins, 
enzymes, co-factors or associated proteins that are involved in recombmation 
reactions involving one or more recombination sites. See, Landy (1994), infra. 

Repression cassette: is a DNA segment that contains a repressor of a 
Selectable marker present in the subcloning vector. 

Selectable marker: is a DNA segment that allows one to select for or 
against a molecule or a cell tiiat contains it, often under particular conditions. 
These markers can encode an activhy, such as, but not lunited to, production of 
RNA, peptide, or protein, or can provide a bmding site for RNA, peptides, 
proteins, inorganic and organic compounds or compositions and die like. 
Examples of Selectable markers include but are not lunited to: (1) DNA 
segments that encode products vdiich provide resistance against otherwise toxic 
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compounds {e.g. , antibiotics); (2) DNA s^ments Ibat encode products which are 
otherwise lacking in the recipient cell (e.g. , tRNA genes, auxotrophic maricers); 
(3) DNA segments that encode products which si^press the activity of a gene 
product; (4) DNA segments that encode products vMch can be readily identified 
(e.g., phenotypic markers such as ^-galactosidase, green fluorescent protem 
(GFP), and cell sur&ce proteins); (5) DNA segments that bind products which are 
otherwise detrimeatal to cell survival and/or iimction; (6) DNA segments that 
otherwise inhibit tiie activity of any of the DNA segments described in Nos. 1-S 
above (e.g., autisense oligonucleotides); (7) DNA segments that bind products 
that modify a substtate (e.g. restriction endonucleases); (8) DNA segments that 
can be used to isolate a desred molecule (e.g. spedfic protein binding sites); (9) 
DNA segmmts tiiat eacods a specific nucleotide sequence whidi can be 
otherwise non-fimctional (e.g., for PGR amplification of subpopulations of 
molecules); and/or (10) DNA segments, wUch when absmt, ^rectiy or indirectiy 
confer senntivity to particular conqx>unds. 

Selection sdiieme: is any metiiod which allows selection, eniichment, or 
identification of a deared Product or Product(s) fiom a mixture contaming tiie 
Insert Donor, Vector Donor, and/or any intermediates, (e.g. a Cointegrate) 
Byproducts. The selection schemes of one preferred embodiment have at least 
two components that are either linked or unlinked during recombinational 
cloning. One component is a Selectable marker. The other component controls 
the expression in vitro or in vivo of the Selectable marker, or survival of the cell 
harboring the plasmid carrying the Selectable marieer. Generally, this controlling 
element will be a repressor or inducer of the Selectable marker, but other means 
for controlling expression of the Selectable marker can ^x used. Whether a 
repressor or activator is used will depend on w^etho- the marker is for a positive 
or negative selection, and tiie exact arrangemoit of the various DNA segments, 
as will be readily apparent to those skilled in the art A preferred requirement is 
that tiie selection scheme results in selection of or enrichment for only one or 
more desired Products. As defined herein, to select for a DNA molecule includes 
(a) selecting or enrichmg for tiie presence of the desired DNA molecule, and (b) 
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seiecting or enriching agamst the presence of DNA molecules Ifaat are not the 
desired DNA molecule. 

In one embodiment, the selection schemes (vdiich can be carried out 
reversed) will take one of three forms, which will be discussed in terms of 
Figure 1 . The first, exemplified herein with a Selectable marker and a repressor 
therefor, selects for molecules having segment D and lacking segment C. The 
second selects against molecules having segment C and for molecules having 
segment D. Possible embodiments of die second form would have a DNA 
segment carrying a gene toxic to cells into wfaidi die in vim reaction products 
are to be introduced. AtoxicgraecanbeaDNAthatisaqpressedasatoxicgene 
product (a toxic protein or KNA), or can be toxic in and of itself. (In the latter 
case, die toxic gene is understood to cany its classical defimtion of "heritable 

Examples of such toxic gene products are well known.in die art, and 
include, but are not limited to, restriction endonucleases (e.g., DptS) and genes 
that kill hosts in the absence of a suppressing fiinction, e.g., kicB. A toxic gene 
can alternatively be selectable in v/(n>, e.g., a restriction site. 

In the second form, segment D carries a Selectable marker. Tbe toxic 
geae would eliminate transfonnants harboring the Vector Donor, Co integrate, and 
Byproduct molecules, vAiHc the Selectable marker can be used to select for cells 
containing the Product and against cells harboring only the Insert Donor. 

The third form selects for cells that have both segments /I and Z> in on 
the same molecule, but not for cells that have both segmoits in trems on different 
molecules. This could be oiibodied by a Selectable marker that is split into two 
inactive fragments, one each on segments AmdD. 

The firagments are so arranged relative to the recombination sites that 
when the segments are brought togedi«r by the recombination event, tiiey 
reconstitute a fimctional Selectable marker. For example, the recombinational 
event can link a promoter widi a structairal gene, can link two firagments of a 
structural gene, or can link genes that encode a heterodimeric gene product 
needed for survival, or can link portions of a replicoa 
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Site-specific lecombinase: is a type ofrecombinasewluch typically has 
at least the following four activities: (1) recognition of one or two specific DNA 
sequences; (2) cleavage of said DNA sequence or sequences; (3) DNA 
topoisomerase activity involved in strand exdiange; and (4) DNA ligase activity 
to reseal the cleaved strands of DNA. See Sauer, B., Current Opinions in 
Biotechnology 5:521-527 (1994). Conservative site-specific recombination is 
distinguished finm homologous recombination and tranq>osition by a high degree 
of specificity for both partners. The strand exchange medianism involves the 
cleavage and rejoining of specific DNA sequences in the absence of DNA 
synfliesis (Landy, A. {\9V9) Am. Rev. Biochem. J&913-949). 

Subclomng vector is a cloning vector comprising a circular or linear 
DNA molecule vUcb includes an qipippriate leplicon. In the presoit invendon, 
the subdoning vector (segment D in Figure 1) can also contain fimctional and/or 
reguhttoiy elements that are desired to be incorporated into the final ptoduct to 
act iqx>n or wiA the cloned DNA Insert (segaoEntA in Figure 1). The subcloning 
vector can also contain a Selectable niaika-(contauied in segment C in Figure 1). 

Vector, is a DNA fliat pro^des a usefiil biological or biochemical 
property to an Insert Examples include plasmids, phages, and other DNA 
sequences wiiich are able to replicate or be replicated in vitro or in a host cell, or 
to convey a desired DNA segment to a desired location within a host cell. A 
Vector can have one or more restriction endonuclease recognition sites at whidi 
the DNA sequences can be cut in a deteiminable fashion widiout loss of an 
esseintial biological fiinction of the vector, and into which a DNA fragment can 
be q)liced in order to bring about its rqjlicadon and cloning. Vectors can further 
provide primer sites, e.g., for PCR, transcriptional and/or translational initiation 
and/or regulation sites, recombinational signals, replicons. Selectable markers, 
etc. Clearly, me&ods of inserting a desired DNA fiagment which do not require 
tiie use of homologous recombmation or restriction em^es (such as, but not 
limited to, UDG cloning of PCR fragments (U.S. Patent No. 5^34,575, entirely 
incorporated herein by reference), T A cloning, and the like) can also be applied 
to clone a fiagment of DNA- into a cloning vector to be used according to the 
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piesent invention. The cloning vector can fiutfaer contain a Selectable marker 
suitable for use in the identification of cells transformed vnSi the cloning vector. 

Vector Donor: is one of the two parental DNA molecules of flie present 
invention vMch carries the DNA segments encoding tbe DNA vector which is to 
become part of the desired Product The Vector Donor comprises a subcloning 
vector D (or it can be called the cloning vector if the Insert Donor does not 
already contain a doning vector) and a segmoit C flanked by lecombinatioa sites 
(see Figure 1). S^ments C and/or D can contain elemoits ttiat contribute to 
selection for die desiied Product daughter molecule, as described above for 
selection sdiemes. The recombination signals can be die same or difEoent, and 
can be acted upon by die same or different recombinases. In addition, the Vector 
Donor can be linear or circular. 

Description 

One general scheme for an in vio-o or iR vm> method of tte invention is 
shown in Figure 1, vAere die Insert Donor and die Vector Donor can be rather 
circular or linear DNA, but is shown as circular. Vector D is exdianged for die 
original cloning vector A. It is desirable to sdect for the daughter vector 
containing elements ^ and I> and agmnst oflier molecules, including one or more 
Cointegnite(s). The square and circle are different sets of recombination sites 
(e.g., lox sites or att sites). Segment i4 or I> can contain at least one Selection 
Marker, e3q)ression signals, origins of replication, or specialized functions for 
detecting, selecting, expressing, mapping or sequracing DNA, where D is used 
in this example. 

Examples of desired DNA segments diat can be part of Element y4 or D 
include, but are not limited to, PCR products, large DNA segments, genomic 
clones or fiagments, cDNA clones, functional elements, etc., and genes or partial 
genes, \i*ich encode usefiil nucleic acids or proteins. Moreover, the 
rcconibinational cloning of die present invention can be used to make ex vivo and 
in vivo gene transfer vehicles for {notein e]q>ressi6n and/or gene therapy. 
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In Figure 1, die sch^e provides tbe desired Product as containing 
vectors DaadA,as follows. The Insert Donor (containing ^4 and 5) is first 
recombined at the square recombination sites by recombination proteins, witii the 
Vector Donor (containing C and D), to form a Co-integrate having each of A-D- 
C-B. Next, recombination occurs at the circle recombination sites to form 
Product DNA (A and D) and Byproduct DNA (C and B). However, if desired, 
two or more different Co-int^rates can be formed to generate two or more 
Products. 

In one embodiment of die present in vitro or in vivo lecombinational 
clonmg mefliod, a method for selecting at least one desired Product DNA is 
provided. This can be imderstood by consideration of die m^ of plasmid 
pEZC726 denoted in Figure 2. Hie two exen^bryiecombmadon sites are oftP 
mAhxP. On one segment defined by dieseates is a kanamydn resistance gene 
whose promoter has heesa replaced by die tetOP opoator/pronioter from 
transposon 2>ilO. In die absoice of tet repressor protdn, K coli RNA polymerase 
transcribes the kanamycin resistance gene finm the tetOP. If tet rqiressor is 
present, it binds to tetOP and blocks transc^ition of the kanamydn resistance 
gene. The odier segment of pEZC726 has the tet repressor gene eiqiressed by a 
constitutive promoter. Thus cells transf(»med by pEZC726 are resistant to 
chloramphenicol, because of the chloramphraiicol acetyl transferase gene on the 
same segment as tetR, but are sensitive to kanamycin. The recombination 
reactions result in separation of the tetR gene from the r^;ulated kanamycin 
resistance gene. This separation results mkanamydn resistance in cells receiving 
the desired recombination Product 

Two different sets of plasmids were constructed to demonstrate the 
in vitro method. One set, for use with Cre recombinase only (cloning vector 602 
and subcloning vector 629 (Figure 3)) contained loxP and loxP 511 sites. A 
second set, for use with Cre and integrase (cloning vector 705 and subcloning 
vector 726 (Figure 2)) contained UaP and att sites. The efficiency of production 
of the desued daughter plasmid was about 60 fold higher using both enzymes 
than using Cre alone. Nineteen of twenty four colonies fiom the Cre-only 
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leacdon contained the desired product, vMe tfaiity eight of thiity eig^t colonies 
fixnn die integrase plus Cie reaction contdned Ae desiied product plasmid. 

Other Selection Schemes A variety ofselectionsdienies can be used that 
are known in the ait as they can suit a particular purpose for which the 
recombinational cloning is carried out Depending yxpon individual preferences 
and needs, a number of dififeient types of selection schemes can be used in the 
recombinational cloning medwdofdie present invention. The skilled artisan can 
talK advantage of the availability of the maiQr DNA segments or methods for 
maldnglfaem and die difbentmediodsofselectionfliat are routinely used in the 
ait Sudi DNA segments include but are not limited to those which encodes an 
activity sudi as. but not limited to, production of RNA, peptide, or protem, or 
providing a binding ate for such RNA, peptide, or protdn. Examples of DNA 
molecules used in devising a selection sdieme are given above, imder the 
definition of "selection scheme" 

Additional examples include but are not limited to: 

CO GenaationofnewprimBratesfi>rPCR(e.£:,juxtip>sitionoftwo 
DNA sequences that were not previously juxtaposed); 

00 Inclusion of a DNA sequence acted upon by a restriction 
endonuclease or other DNA modifying en^me, chemical, 
ribozyme, etc.; 

(iiO Inclusion of a DNA sequence recognized by a DNA binding 
protein, RNA, DNA, chemical, etc.) (e.^., for use as an aSinity 
tag for selecting for or excluding from a population) (Davis, Nucl. 
Acids Res. 2^:702-706(1996); J. Virol. 69: 8027-8034(1995)); 

Ov) In vitro selection of RNA Ugands for flie ribosomal L22 protein 
associated wifli Epstein-Ban vitus-e^qnessed RNA by using 
randomized and cDNA-derived RNA librmies; 

(vi) The positiomngoffimctional elements whose activity requires a 
q)edfic oiientation or juxtaposition (e.g., (a) arecombination site 
which reacts poorly in trans, but \riien placed in cis, in the 
presence of die qipropriate protdns, results in recomUnation that 
desh»ys certain popdations of molecules; (e.g., reconstitution of 
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a promoter sequence that allows in viUv RNA synthesis). The 
BNA can be used diixc&y, or can be reverse transcribed to obtain 
tiie desired DNA construct; 

(vii) Selection of the desired product by size {e.g., fractionation) or 
ttOasx phyacal property of the molecuIe(s): and 

(viiO Inclusion of a DNA sequence required for a specific modification 
mediyMon) that allows its identification. 

After finmation of the Product and Byproduct in the method of the present 
invention, tiie sdection st^ can be canied out eidia in W/>v or iR vivo depending 
iqjon tiie particular sdection scheme ^diich has been optionally devised in the 
particular recombinational clomng procedure. 

Forexanq>le, an m vitro metiiod of sdection can be devised fiv the Insert 
DonoraiidVectorDonorDNAinolecules. Such scheme can involve engineering 
a rare restriction site in tiie starting circular vectors in such a way that after the 
reoomlnnation events tiie rare cutting ates Old 19 in the Byproduct Ifeice,v<dien 
tiie restriction en^me wdiich bmds and cuts at the rare restriction site is added to 
tiie reaction mixture in vitro, all of the DNA molecules carrymg tiie rare cutting 
site, ie., the starting DNA molecules, tiie Cointegrate, and tiie Byproduct, will be 
cut and rendered nonreplicable in the intended host cell. For example, cutting 
sites in segments 5 and C (see Figure 1) can be used to select against all 
molecules except flie Product Alternatively, only a cutting site in C is needed if 
one is able to select for segment D, e.g., by a drug resistance gene not found on B. 

Similarly, an in vitro selection metiiod can be devised w4ien dealing witii 
linear DNA molecules. DNA sequences complementary to a PCR primer 
sequence can be so engineered that they are transferred, through the 
recombinational cloning metiiod, only to the Produa molecule.-. After the 
reactions are completed, tiie iqipnqniate primers are added to tiie reaction solution 
and tiie sample is subjected to PCR. Hence, all or part of tiie Product molecule 
is amplified. 

Otiier^R vivo selection schemes can be used with a variety off. co/i cell 
lines. One is to put a repressor gene on one segment of tiie subcloning plasmid, 
and a drug marker controlled by that repressor on the other segment of tiie same 
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plasmid. Anotho- is to put a killer gene on segment C of the subcloningplasmid 
(Figure 1). Of course a way must exist for growing such a plasmid, i.e., there 
inust exist circumstances under vUch the killer gene will not kill. There are a 
number of these genes known which require particular strains of £ coli. One 
such scheme is to use die restriction enqrme DprA, which will not cleave unless 
its recognition sequence GATC is mediylated. Many popular common E. coli 
strsdns mediyiate GATC sequences, but there are mutants in which cloned Dpnl 
csm be aq)iessed without harm. 

Of course analogous selection schemes can be devised for other host 
organisms. For example, die te/repressoi/operator of 7r70 has been adapted to 
control gene e:qxesaon in eukaiyotes (Gossen, M., and Bujard, H.. Proc NatL 
Acad. ScL USA 8^.S547-SSS1 (1992)). Thus (bs same control of drug resistance 
by the tet rqnessor exemplified hoein can be applied to select for Product in 
eukatyotic cells. 

Recombina&m Protons 

In flie present mvention, fl)e exchange of DNA segments is achieved by 
the use of recombination proteins, including recombinases and associated 
oo-£ictors and proteins. Various recombination proteins are described m the art 
Examples of such recombinases include: 

Cre: A protein from bacteriophage PI (Abremski and Hoess, J. Biol. 
Chenu 2J9(3).1509-1514 (1984)) catalyzes tiie exchange (i.e., causes 
recombination) between 34 bp DNA sequences called loxP (locus of crossover) 
sites {See Hoess et al, Nucl. Acids Res. ;¥(5):2287 (1986)). Cre is available 
commerdally (Novagen, Catalog No. 69247-1). Recombination mediated by Cre 
is fieely tevrasible. From diennodynamic considerations it is not surprising that 
Cre-mediated int^ration (recombination between two molecules to form one 
molecule) is mudi less efBcient tiian Cre-mediated excision (recomlnnation 
between two loxP sites in flie same molecule to form two daughter molecules). 
Cre works in simple buffers with eidier magnesium or spermidine as a co&ctor, 
as is well known in tiie art. The DNA substrates can be eidier linear or 
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supercoOed. A number of mutant loxP sites have been desraibed (Hoess ef aL , 
supra). One of fliese, loxP 511, lecombines wifli another loxP 511 site, but wll 
ndt recombine with a loxP site. 

Integrase: A protein from bacteriophage lambda that mediates the 
integration of the lambda genome into the E. coli chromosome. The 
bacteriophage X Int recombmational proteins promote irreversible recombination 
between its substrate att sites as part of the formation or induction of a lysogenic 
state. Reversibility of flie recombination reactions rnults from two independent 
pathways for integrative and excisive recombination. Each paAway uses a 
liiiique, but overli^mg, set of the IS protein binding ates diat comprise att site 
DNAs. Cooperative and competitive interactions invoiraig fotn: proteins (Int, 
fis, IHF and FIS) determine the direction of recombination. 

Integrative lecomlnnation involves the Int and IHF proteins and sites attP 
(240 bp) and attB (2S bp). Recombination results in the foimation of two new 
. sitttraffiLandoff/L Exdsive recombination requires Int. IHF, and Xis, and sites 
ofll, and otrR to generate aflP and aa£. Under certam conditions, FIS stimuUtes 
excisive recombination. In adttition to tiiese normal reactions, it should be 
appreciated that attP and attB, when placed on the same molecule, can promote 
exdsive recombination to generate two excision products, one with attL and one 
with attR. Similarly, intermolecular recombination between molecules 
containing attL and attR, in the presence of Int, IHF and Xis, can result in 
integrative recombination and the generation attP and attB. Hence, by flanking 
DNA segments with appropriate combinations of engmeered att sites, m the 
presence of the ^propriate recombination proteins, one can direct excisive or 
integrative recombination, as reverse reactions of each other. 

Each of the att sites contains a IS bp core sequence; individual sequence 
elements of fimctional significance lie wifliin, outside, and across tiie boundaries 
of this common core (Landy, A., Ann. Rev. Biochem. 55:913 (1989)). EfiBcient 
recombmation between the various att sites requires that the sequence of die 
coitral common region be identical between the leoombining partners, however, 
the exact sequence is now found to be modifiable. Consequentiy, derivatives of 
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fte att site wilfa dianges vd&in &e core are now discovered to recombine as least 
as efBciently as tiie native core sequences. 

Integrase acts to recombine the attP ate on bacteriophage lambda (about 
240 bp) with the attB site on the E. coli genome (about 25 bp) (Weisberg, R.A. 
and Landy, A. in Lambda Zf, p. 21 1 (1983). Cold Spring Harbor Laboratory)), to 
produce the integrated lambda genome flanked by attL (about 100 bp) and attR 
(about 160 bp) sites. In the absence of Xis (sec below), this reaction is essentially 
irreversible. The integration reaction mediated by integrase and IHF works 
in vitro, with simple buffer containing spermidine. Int^rase can be obtained as 
desaibedbyNadi,HA,Mc<fto&o/£ngwK>/o|gyJ(W-^ (1983). IHF can 
beobtained as described by Filutowicz, M., etaL, Gene 147:149-150 (1994). 

In fbs presence of Ifae X protein Xis (excise) integrase catalyzes the 
reaction of attR and attL to form attP and attB, ie., it promotes fta reverse of the 
reaction described above. Tliis reaction can also be applied in the present 
invention. 

(Mter Recombination Systems. Numoous recombination systems fix>m 
various otganisQis can also be used, based on the teachmg and guidance provided 
herein. See. e.g.. Hoess et al.. Nucleic Acids Research 7<6):2287 (1986); 
Abremski et a!., J. Biol. Chem.26l(l):39l (1986); Can^jbell, J. Bacterial 
/7-/(23):7495 (1992); Qian el al., J. Biol. Chem. 2tf7(ll):7794 (1992); Araki 
et al., J. Mol. Biol 225(l):25 (1992)). Many of these belong to the integrase 
family of recombinases (Argos et al. EMBOJ. 5:433-440 (1986)). Perhaps the 
best studied of these are the Integrase/aff system from bacteriophage A. (Landy, 
A. (1993) Current Opinions in Genetics and DeveL 5:699-707), the CxdloxP 
system from bacteriophage PI (Hoess and Abremski {Ji99Q)\n Nucleic Acids and 
Molecular Biology, vol. 4. Eds.: Eckstein and Lilley, Berlin-fleidelberg: 
Springer-Verlag; pp. 90-109) , and the FLP/FRT system from the Saccharomyces 
cerewsiae 2 n circle plasmid (Broach et aL Cell 29:227-234 (1982)). 

Members of a second fiimlly of site-specific recombinases, the resolvase 
(.e.g., y6, Tn3 resolvase, Hin, Gin, and CSn) are also known. Members of 
this highly related £unily of recombinases are typically constrained to 
intramolecular reactions {e.g., inversions and excisions) and can require host- 
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encoded &ctois. Mutants have been isolated Ibat leUeve some of the 
requironents for host fectois (Maeser and Kahnmann (1991) Mol. Gen. Genet. 
230: 170-176), as well as some of the constraints of mtramolecular recombination. 

OOier site-specific lecombinases amilar to 1 M and sinular to PI Cre can 
be substituted for Int and Cre. Such lecombinases are known. In mai^ cases the 
puiification of such other recombinases has been described in the ait In cases 
. when they are not known, ceil extracts can be used or the enzymes can be 
partially purified using procedures described for Cre and Int 

While Cre and Int are described in detail for reasons of example, many 
related recombinase systems exist and their application to die described invention 
is also provided according to the present invention. The integrase family of site^ 
spedfic recombinases can be used to pro\dde alternative recombination proteins 
andrecomlnnation stes for the present invention, as site-specific recombination 
proteins encoded by bacteriophage lambda, phi 80, P22, P2, 186, P4 andPl. This 
groiqi of proteins esdiibits an unesqpectedly large diversity of sequences. Despite 
tbis diversity, all of the lecombimises can be aligned in dieir C-tenninal halves. 

A 40-re»due region near flie C temmnis is particulariy well conserved in 
all die proteins and is homologous to a region near die C terminus of die yeast 2 
mu plasmid Fip proteiit Three positions are pofecdy conserved within this 
family: faistidine, aiginine and tyrosine are found at respective alignment 
positions 396, 399 and 433 witiiin tiie well-conserved C-teiminal region. These 
residues contribute to die active site of this fiunily of recombinases, and suggest 
tiiat tyiosine-433 forms a transient covalent linkage to DNA during strand 
cleavage and rejoining. See. e.g., Argos, P. etal.. EMBOJ. 5:433-40 (1986). 

Alternatively, IS231 and other Bacillus thuriiigiensis transposable 
elements could be used as recombination proteins and recombination sites. 
Bacillus thuringiensis is an entomopadiogenic bacterium whose toxicity is due 
to the presence in the sporangia of delta-endotoxin crystals active against 
agricultural pests and vectors of human and animal diseases. Most of the genes 
coding for fliese toxin proteins are plasmid-bome and are generally structurally 
associated vntii insertion sequences (IS23 1 , IS232, IS240, ISBTl and ISBT2) and 
transposons (Tn4430 and Tn5401). Several of tiiese mobile elements have been 
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shown to be active and participate in tlie crystal goie mobility, thereby 
oontiibudng to &e variatioa of bacterial iaaaaty. 

Structural analysis offlieiso-IS231 dements indicatesOatthey are related 
to ISllSl from Clostridium perfringem and distantly related to IS4 and ISI86 
firom Escherichia coli. Like the other IS4 femily members, they contain a 
conserved transposase-integrase motif found in other IS &milies and retroviruses. 

Moreover, fimctional data gathered from IS231A in Escherichia coli 
indicate a non-replicadve mode of transposition, wth a preference for specific 
targets. Similar results were also obtained in Bacillvs subtilis and B. 
thurir^enHs. See, e.g., MahiUon, J.etaL, Genelica 93:13-26 (1994); Canq)bdl, 
J Aic/er/ot 7495-7499 (1992). 

The amiDunt of cecombinase wfaidi is added to drive the recombination 
reaction can be determined by using known assays. Specifically, titration assay 
is used to determine the appropriate amount of a purified recombinase en^me, 
or the q)propriate amount of an extract 

Engineered Xecombinatian Sites. The above recomlnnases and 
corresponding recombinase sites are suitable for use in recombination cloning 
according to the present invention. However, wild-type recombination sites 
contain sequences that reduce the efficiency or specificity of recombination 
reactions as ai^lied in methods of the present invention. For example, multiple 
stop codons in attB, attR, attP, attL and ioxP recombination sites occur in 
multiple reading frames on both strands, so recombination efficiencies are 
reducted, e.g., vAick the coding sequence must cross the recombmation sites, 
(only one reading frame is available on each strand of loxP and attB sites) or 
impossible (in attP, attR or attL). 

Accordingly, the present invention also provides engineered 
recombination sites that overcome these problems. For example, att sites can be 
engineoed to have one or multiple mutations to enhance specificity or efiBciency 
of the recombination reaction and the properties of Product DNAs {e.g., attl, attZ, 
and att3 sites); to decrease reverse reaction (e.g., removing PI and HI fiom attB). 
The testing of these mutants determines which mutants yield sufficient 
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lecombinational activity to be suitable for Fecombination subcloning according 
to the present invention. 

Mutations can therefore be introduced into recombination sites for 
enhancing site specific recombination. Such mutations include, but are not 
limited to: recombination ates widiout translation stop codons that aUow fusion 
proteins to be encoded; recombination sites recognized by the same proteins but 
differing in base sequence such that they react largely or exclusively with their 
homologous partners allow multiple reactions to be contemplated. Which 
particular reactions take place can be specified by vMch particular partners are 
present in the reaction mixture. For example, a tripartite protein fiiaon could be 
accomplished with parental plasmids containing recombination sites attRl and 
attR2; attLl and atfL3; and/or attR3 and 9tiL2. 

These are weQ known procedures for introducing specific mutations into 
nuddc add sequences. A number of diese are described in Ausubel, F.M. et al.. 
Current Protocols in Molecular Biology, Wiley Intersdence, New York (1989- 
1996). Mutations can be designed into oligonucleotides, v^ch can be used to 
mo^fy existing cloned sequraces, or in anq>lification reactions. Random 
mutagenesas can also be employed if appropriate selection metiiods are available 
to isolate the desired mutant DNA or RNA. The presence of the desired 
mutations can be confirmed by sequencing the nucleic acid by well-known 
methods. 

The following non-limiting methods can be used to engineer a core region 
of a given recombination site to provide mutated sites suitable for use in the 
present invention: 

1. By recombination of two parental DNA sequenc^ by site-specific (e.g. 

&X!iL and attR to give attB) or other (e.g. homologous) 
recombination mechanisms. The DNA parental DNA segments 
containing one or more base alterations resulting in the final core 
sequence; 

2. By mutation or mutagenesis (site-specific, PGR, random, spontaneous, 

etc) diiectiy of the desued core sequence; 



-28- 

3. By mutagdiesis (ate-spedfic. PGR, landdm, spontanteous, ietc) of . 

parental ONA sequences, vAath aie lecombined to generate a 
desiied core sequence; and 

4. By reverse transcrq)tion of an RNA encoding the desired cote 

sequence. 

The fimctionality of the mutant recomlnnation sites can be demonstrated 
in ways that depend on the particular diaracteristic that is desired For example, 
the lade of translation stop codons in a recombination site can be demonstrated 
by expressing the appropriate fusion proteins. Specificity of recombination 
between homologous partners can be demonstrated by introducing the appropriate 
molecules into in vitro reactions, and assaying for recombination products as 
described hodn or known in the ait Odierdeared mutations in recombination 
sites mi^ include the presence or absoice of restriction sites, translation or 
tiansaiption start agnals, proton binding dies, and other known functionalities 
of nucleic acid base sequences. Genetic selection schemes for particular 
fimctional attributes in tiie recombination sites can be used according to known 
metiiod steps. For example, the modification of sites to provide (fiom a pair of 
sites tiiat do not mteract) partnos tibat do interact could be achieved by requiring 
deletion, via recombination betwem the sites, of a DNA sequence encocUng a 
toxic substance. Similarly, selection for sites that rranove translation stop 
sequences, the presence or absence of protein binding sites, etc., can be easily 
devised by those skilled in the art 

Accordingly, the present mvention provides a nucleic acid molecule, 
comprising at least one DNA segment having at least two engineered 
recombination sites flanking a Selectable marker and/or a desired DNA segment, 
wiierein at least one of said recombination sites comprises a core region having 
at least one engineered mutation that enhances recombination in vitro in the 
formation of a Cointegrate DNA or a Product DNA. 

The nucleic add molecule can have at least one mutation that confers at 
least one enhancement of said recombination, said enhancement selected &om the 
group consisting of substantially 0) favoring excisive integration; (it) fevoring 
excisive recombination; Oi) relie>ang the requirement for host factors; (iii) 
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increasing the eiGSdeocy of said CointBgrate DNA or Product DNA foimadoi^ . 
and Civ) increasing the specifidty of said Cointegtate DNA or Product DNA 
formation. 

The nucleic add molecule preferably comprises at least one 
5 lecrambinadon site derived fiomattB,attP,attL or attR, More preferably the att 

ate is selected from attl, att2, or att3, as described herein. 

In a preferred embodiment, the core regjon comprises a DNA sequence 
selected fioni tiie gtoiq> consisting of: 

(a) RkYCWGCTriYKTRTACNAASTSGB(m-att)(SEQIDNO:l); 
10 (b) AGCCWGCTTTYKTRTACNAACTSGB (m-atffl) (SEQ ID 

N0:2); 

(c) GTTCAGCnTCKlRTACNAACTSGB (m-attR) (SEQ ID 

N0:3); 

(d) AGCCWGCTTTCKTRTACNAAGTSGB (m-attt.) (SEQ ID 

15 N0:4); 

(e) GTTCAGCTTTYKTRTACNAAGTSGB(m-attPl) (SEQ ID 

N0:5); 

or a corresponding or complementary DNA or RNA sequence, wherein R=A or 

G; K=G or T/U; Y=C or T/U; W=A or TAJ; N=A or C or G or T/U; S=Cor G; 
20 and B=C or G or T/U, as presented in 37 CJ'.R. §1.822, vMch is entirely 

incorporated herdn by reference, wherein the core region does not contain a stop 

codon in one or more reatting fiames. 

TIk core region also preferably comprises a DNA sequence sdected from 

the group consisting of. 
25 (a) , AGCCTGCTnnTGTACAAACTTGT(attBl)(SEQIDN0:6); 

(b) AGCCTGCnTCTTGTACAAACTTGT(attB2)(SEQIDNO:7); 

(c) ACCCAGCrrrCnTGTACAAAeTTGT (attBB) (SEQ ID N0:8); 
(<^ GTTC7lGCITrmGTACAAACTTGT(attRl)(SEQIDN0:9); 
(e) GTrCAGCriTCTTGTACAAACTTGT(attK2)(SEQIDNO:10); 

30 (0 GTTCAGCTTrCTfGTACAAAGTrGG (attR3) (SEQ ID 

N0:11); 
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(g) AGCCTGCTTTTTTGTACAAAGTTGG (attLl) (SEQ ID 
N0:12); 

(h) AGCCTGCnTCTTGTACAAAGTTGG (attL2) (SEQ ID 
N0:13): 

CO ACCCAGCnTCTTGTACAAAGTTGG (attL3) (SEQ ID 
N0:14); 

0) GTTCAGCrTTmGTACAAAGTrGG(attPl)(SEQIDN0:l5); 

(k) GTTCAGCTTTCrrGTACAAAGTTGG (altP2J3) (SEQ ED 
N0:i6); or a corresponding or complementary DNA or RNA sequence. 

The ptesent inv&ntion thus also provides a mediod for making a nucleic 
add molecule, conqsising providing a nuddc add molecule haraig at least one 
en^neeied recombination dte conqirising at least one DNA sequence ha^nng at 
least 80-99% homology (or any range or value therein) to at least one of SEQ ID 
N0S:1-16, or any suitable recombination site, or which hybridizes under 
stnngoit conditions thereto, as known in die art 

Clearly, there are various types and pemiutadons of such well-known 
in vitro and in vivo selection methods, each of which are not desaibed herdn for 
the sake of brevity. However, such variations and permutations are contemplated 
and considered to be the different embodiments of the present invention. 

It is important to note that as a result of the preferred embodiment being 
in vitro recombination reactions, non-biological molecules such as PGR products 
can be manipulated via the present recombinational cloning method. In one 
example, it is possible to clone linear molecules into circular vectors. 

There are a number of applications for the present invention. These uses 
include, but are not limited to, changing vectors, apposing promoters with genes, 
constructing genes for fiision proteins, changing copy number, changing 
replicons, clomng mto phages, and cloiung, e.g., PGR products (widi an ottB site 
at one end and a loxP site at the other eaS), genomic DNAs, and cDNAs. 
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The Mowing exan^les are intanled to fuTlba iUusbate 
mtbodimrats of the invendon and are not intended to be linutine in tiatuie. 

Example 

The present recombinadona! cloning method accomplishes the exchange 
of nucleic acid segments to tender something useM to the user, sudi as a chaise 
of cloning vectors. These segments must be flanked on bofli sides by 
recombination signals that are in the proper orientation with respect to one 
anodier. In the examples below the two parental nucleic acid molecules {e.g., 
plasmids) are called the Insert Donor and flie Vector Donor. The Insert Donor 
contmns a segment that will become joined to a new vector contributed by the 
Vector Donor. The recombination intamediate(s) diat contain(s) both startmg 
molecules is called the Cointegiate(s). The second lecomUnaticm event produces 
two daughter molecules, called the Product (the deared new clone) and the 
Byproduct 

Bikers 

Various known buffers can be used in the reactions of the present 
invention. For restriction enzymes, it is advisable to use the buffers 
recommended by the manu&cturer. Alternative buffers can be rea^y found in 
&e literature or can be devised by those of ordinary skill in the art 

Examples 1-3. One exemplary buffer for lambda integrase is comprised 
of 50 mM Tris-HCl, at pH 7.5-7.8, 70 mM KCl, 5 mM spermidine, 0.5 mM 
EDTA, and 0.25 mg/ml bovine serum albumin, and optionally, 10% glycerol. 

One preferred buffer for PI Cre recombinase is comprised of 50 mM 
Tris-HCl at pH 7.5, 33 mM NaCl, 5 mM spermidine, and 0.5 mg/ml bovine 
serum albumin. 

The buffer for other site-specific recombinases which are similar to 
lambda Int and PI Cre are either known in the art or can be determined 
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empiricaUy by the ^ed artisans, particulariy in Ugjit of flie abovcKlescribcd 

Eicample I: RecombiuaUoma Clonittg Using CreandCnSc. Int 

Two pairs of plasmids were constntcted to do the in vitro lecombinadonal 
cloning method in two different ways. One pair, pEZC705 and pEZC726 
(Figure 2A), was constructed with loxP and att sites, to be used wifli Cre and 
A, integrase. The othapair, pEZC602 and pEZC629 (Figure 3 A), contained the 
haP (wUd type) ate for Cre, and a second mutant lax site, loxP 511, v*ich 
differs from loxP in one base (out of 34 total). The minimum requirement for 
leoombinational cloning of tiie present invention is two lecombhiation ates in 
each plasmid, in generalXand F, andX* and F. Recombinational cloning takes 
place if difaer or bodi types of site can recombine to form a Comt^rate {e.g. X 
and X), and if either or both (but necessarily a site different fiom die type 
forming the Cointegrate) can recombine to excise the Product and Byproduct 
plasmids fiom the Coint^rate {e.g Y and F). It is important diat die 
recombination sites on the same plasmid do not recombine. It was found that the 
presoitrecombinational cloning could be done with Cre alone. . 

Cre^nly 

Two plasmids were constructed to demonstrate this conception {see 
Figure 3 A). pEZC629 was the Vector Donor plasmid. It contained a constitutive 
drug marker (chloramphenicol resistance), an origin of replication, loxP and 
loxP 511 sites, a conditional drug marker (kanamycin resistance whose 
expression is controlled by the operator/promoter of the tetracycline resistance 
operon of transposon TnlO), and a constitutively expressed gene for the tet 
repressor protein, tetR. E. colt cells containing pEZC629 were resistant to 
chloramphenicol at 30 fig/ml, but sensitive to kanamycin at 100 jig/ml. pEZC602 
was the Insert Donor plasmid, which contained a different drug maiker 
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(ampicUlin resistance), an origin, and loxP and loxP 511 sites flankiiig a multiple 
cloning site. 

This experiment was comprised of two parts as follows: 

Part I: About 75 ng each of pEZC602 and pEZC629 were mixed in a 

5 total volume of 30 nl of Cre buffer (50 mMTris-HClpH 7.5, 33 mMNaCl, 

5 mM spermidine-HCl, 500 ng/nA bo-wne serum albumin). Two 10 (il aliquots 
were transferred to new tubes. One tube received 0.5 fil of Cre protein (approx. 
4 units per nl; partially purified according to Abrranski and Hoess, J. Biol. Chem. 
25P:1509 (1984)). Both tubes were incubated at 37°C for 30 minutes, then 70'C 

10 for 10 nunutes. Aliquots of each leaction were diluted and transformed into 

DH5a. Following oipiession, aliquots were plated on 30 (igAnl 
diloranqihaucol; 100 (ig/ml anqucillin plus 200 }igM meflndnii^ or 100 (tg/knl 
kanan^cin. Residis: See Table 1. The reaction without Cre gave l.llxlO' 
anqudllin reastant colonies (fiom &e Insert Donor plasnud pEZC602); 7.8x10^ 

15 cMoramphenicol resistant colonies (fiom die Vector Donor plasmidp£ZC629): 

and 140 kanamydn resistant colonies (bad^tound). llie rea^ 
gave 7.5x10^ anopicillin resistant colomes (finm the Insert Donor plasmid 
pEZC602); 6.1x10* diloramphenicol ledstant colonies (fiom the Vector Donor 
plasmid pEZC629); and 760 kanamydn resistant colomes (mixture of 

20 background colonies and colonies fiom the lecombinational clorung Product 

plasmid). Analysis: Because the number of colonies on the kanamycin plates 
was mudi bi^ier in the presoice of Ore, many or most of Ifaem were predicted to 
contain the d^red Product plasmid. 



Table 1 


Enzyme 


Ampicillin 


Chloramphenicol 


Kanamycin 


EfBciency 


None 


l.lxlO' 


7.8x10' 


140 


140/7.8x10'= 0.02% 


Cre 


7.5x10' 




760 


760/6.1x10'= 0.12% 



Part II: Twenty four colomes fiom the "+ Cre" kanamycin plates were 
picked and inoculated into meditun containing 100 ffg/ml kanamydn. Minipieps 
were done, and the miniprep DNAs, uncut or cut with Smal or Hin&Vi, were 
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electrophoiesed. Results: 19 ofthe 24 minipreps showed supercoiledplastnid 
of the size predicted for the Product plasmid. Ail 19 showed the predicted Smal 
and HinSSSL restriction fragments. Analysis: The Cre only scheme was 
demonstrated. Specifically, it was detomined to have yidded about 70% (19 of 
24) Product clones. The e£Sdaicy was about 0.1% (760 kanamydn resistant 
denes resulted from 6.1x10* ddoran^henicol resistant colonies). 

Ore Plus Itttegrase 

Tlic plasmids used to demonstrate fliis method are exactly analogous to 
those used abovie, excq>t Oat pEZC726, tiie Vector Donor plasmid, contained an 
attPsSt'm place of bsxP 511, and pEZCJQS, tbe Insert Donor plasmid, contained 
an afffi ate in place afloxP 511 (Figure 2A). 

Tliis experiment was conqirised of three parts as fbUows: 
Fart I: About 500 ng of pEZC70S (the Insert Donor plasmid) was cut 
with Seal, vMch linearized die plasmid within the ampicillin resistance gene. 
(This was done because the A, integrase reaction has been historically done wfli 
the affS plasmid in a linear state (H Nash, personal communication). However, 
it was found later that the integrase reaction proceeds well with botii plasmids 
supercoiled.) Then, the linear plasmid was ethanol precipitated and dissolved in 
20 nl of A. integrase bufifer (50 mM Tris-HCl, about pH 7.8, 70 mM KCl, 5 mM 
spennidine-HCl, 0.5 mM EDTA, 250 ng/ml bovine senan albumin). Also, about 
500 ng of the Vector Donor plasmid pEZC726 was ethanol precipitated and 
dissolved in 20 )il A. integrase buffer. Just before use, A integrase (2 nl, 
393 |ig/ml) was thawed and diluted by adding 18 fil cold A. integrase buffer. 
One 111 IHF Ont^ration host factor, 2.4 mg/ml, an accessory protein) was diluted 
into 150 \]1 cold X integrase buffer. Aliquots (2 pi) of each DNA were mixed 
wi& A. integrase bufifer. wi& or wi&out 1 pi each X integrase and IHF, in a total 
oflOpL The mixture was incubated at 25*C for 45 minutes, flien at 70*C for 
10 minutes. Half of eadi reaction was applied to an agarose gel. Results: inibs 
presence of mtegrase and IHF. about 5% ofthe total DNA was converted to a 
linear Cointegiate form. ylRa()tr&.- Activity ofintegrase and IHF was confirmed. 
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Pttrtll: ThreeiiUCToliteisofeaicAreacdon(ic.,v«&or\wlhoiitint^^ 
and IHF) were diluted into 27 fil of Cre buffer (above), then each reaction was 
split into two 10 (il aliquots (four altogether). To two of these reactions, 0.5 jil 
of Cre protein (above) were added, and all reactions were incubated at 37°C for 
30 minutes, then at 70°C for 10 minutes. TE buffer (90 jil; TE: 10 mM Tris-HCl, 
pH 7.5, 1 mM EDTA) was added to each reaction, and 1 jil each was transformed 
into K coli DH5a. The transformation mixtures were plated on 100 (ig/ml 
anq)icillin plus 200 |ig/ml methidllii^ 30 |ig/ml chloramphenicol; or 100 |ig/oiI 
kanamycin. Results: See Table 2. 



table! 


Enzyme 


Ampicillin 


Chloramphenicol 


Kanamycin 


Efficiency 


None 


990 


20000 


4 


4/2x10^=0.02% 


Cre only 


280 


3640 


0 


0 


1 Integrase* 
1 only 


1040 


27000 


9 


9/ 2.7x10* = 0.03% 


1 Integrase* 
1 +Cre 


110 


lUO 


76 


76/1.1x10' = 6.9% 



* Integrase reactions also contained IHF. 



Analysis: The Cre protein impaired transformation. When adjusted for 
this effect, tbe nuiiiber of kanamydn resbtant colonies, compared to the control 
leaictioiis, increased more tiian 1 00 fold when bodi Cre and Integrase were used. 
This su^ests a spedficity of greater than 99%. 

Part HI: 38 colonies were picked from the Integrase plus Cre plates, 
miniprep DNAs were made and cut with HinSSl to give diagnostic nuq>ping 
information. Result: All 38 had precisely the expected fiagment sizes. 
Analysis: The Cre plus A, integrase method was observed to have mudi higher 
spedficity than Cre-alone. Conclusion: The Cre plus A, integrase method was 
demonstrated. EfSdency and specificity were much higher than for Cre only. 
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Exanqik 2: Using in vitro Eeettmbinational Qoning to Subclone the 
Chloraa^henieol Acetyl Tmusferase Gate into a Vector for Ej^ression in 
Eukaryo&: Cells (rtgure4A) 

An Insert Donor plasmid, pEZC843, was constiucted, conqnising the 
diloramphaiicol acetyl transferase gore of £ coli, cloned between kxP and ottB 
sites such tiiat the loxP site was positioned at t}ie S'-end of the gene (Figure 4B). 
A Vector Donor plasmid, pEZC1003, was constructed, which contamed flie 
cytomegalovirus eukaryotic promoter apposed to a loxP site (Figure 4C). One 
microliter aliquots of eadi supercoiled plasmid (about 50 ng crade miniprep 
DNA) were combined in a ten microliter reaction containing equal parts of 
lambda integrase buffer (50 mM Tris-HCl, pH 7.8, 70 mM KCI, 5 mM 
spermidine, 0.5 mM EDTA, 0.25 mg/ml bovine serum albumin) and Cre 
recombinase buffer (50 mM Tris-HCl, pH 7.5, 33 mM NaCl, 5 mM spermidine, 
0.S mg/ml bovine setxim albumin), two units of Cie recombinase, 16 ng 
integradon host &ctor, and 32 ng lambda integcase. After incubation at 30°C for 
30 minutes and 7S*C for 10 minutes, one microliter was transformed into 
competent £ coli sttiEun DH5a (Ufe Tedmologies, Inc.). Aliquots of 
transformations were spread on agar plates containing 200 pg/ml kaitamycin and 
incubated at 37*C overnight An otherwise identical control reaction contained 
flie Vector Donor plasmid only. He plate receiving 10% of the control reaction 
transfonnation gave one colony, the plate receiving 10% of the recombinational 
clomng reaction gave 144 colonies. These numbers suggested that greater than 
99% of the recombinational cloning colonies contained the desired product 
plasmid. Miniprep DNA made from sk recombinational cloning colonies gave 
the predicted size plasmid (5026 base pairs), CMVProd. Restriction digestion 
with ATcoI gave the fri^ents predicted for die diloranqihenicol acetyl transferase 
cloned downstream of the CMV promote for all six plaanids. 
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Example 3: Subdoned DNA Se^naOs Flanked by attB Sites Without Stop 
Codons 

Parti: Bael^ound 

Hie above examples are suitable for transcriptional fusions, in vMdi 
tiaosoiption raosses lecombination sites. However, both atffi. and loxP sites 
contain multiple stop codons on both strands, so taanslational fiisions can be 
di£Bcuh, vihae tfae axfing sequence must cross the recomUnation sites, (only one 
reading fiame is available on eadi strand of loxP ates) or impossible (in attR or 
attL). 

A prindpal reason for subcloning is to fiise protein domains. For 
example, fiiaon of the glutathione S-tiansferase (GST) domain to a protein of 
interest allows the fusion protein to be purified by afOnity chromatography on 
glutathione agarose (Pharmacia, Inc., 1995 catalog). If the protein of interest is 
fused to luns of consecutive histidines (for example His6), the fusion protein can 
be purified by afBnity chromatography on chelating resins containing metal ions 
(Qiagen, Inc.). It is often desirable to compare amino terminal and carboxy 
terminal fusions for activity, solubility, stability, and the like. 

The attB sites of the bacteriophage X integration system were examined 
as an alternative to loxP sites, because they are small (25 bp) and have some 
sequence flexibUity (Nash, RA. etaL. Proa Natl Acad. ScL USA 84:4049^053 
(1987). It was not previously suggested that multiple mutations to remove all 
stop codes would result in usefiil recombination sites for recombinational 
subcloning. 

Using standard nomenclature for site specific recombination in lambda 
bacteriophage (Weisber, in Lambda III, Hendrix, et td., eds.. Cold Spring Harbor 
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Laboratoty, Cold Spring Haibor, NY (1989)), flie nucleotide r^ons Oat 
participate in die recombinatioaieaction in an £ co/i host cell are represented 

as follows: 

attP --P1— HI— P2— X— H2— C-O-C— H'--P'l--P'2— P'3 — 

attB —B-O-B'" 

Int, IHF 11 Xis, Int. IHP 

attR — PI— m— P2— X— H2--C-0-B' — 

attli — B-O-C— H'— P'l— P'2— P'3--, 

wliere: O represents the 15 bp core DNA sequence found in both the phage and 
E. coli genomes; B and B' represent approximately 5 bases adjacent to the core 
in the K coli genome; and PI, HI, P2, X, H2, C. C, H. Fl, P'2. and F3 represent 
known DNA sequences encoding protein binding domains in the bacteriophage 
A, genome. 

The reaction k reversible m die presence of the protein Xis (excisionase); 
recombination between attL and attR precisely excise the A. genome fiom its 
int^rated state, regenerating the circular A. genome containmg attP and the linear 
K coli genome containing attB. 

Part 11: Construction and Testing ofPlasmids Containing Mutant att Sites 

Mutant attL and attR sites were constructed. Importandy, Landy et al. 
{Ann. Rev. Biochem. 58:913 (1989)) observed tiiat deletion of die PI and HI 
domains of attP fecilitated the raccision reaction and eliminated the int^ration 
reaction, diereby maldng die excision reaction irreversible. Therefore, as 
mutations were introduced in attR, die PI and HI domams were also deleted. 
attR sites in die present example lack die PI and HI regions and have die Ndel 
site removed (base 27630 changed fiom C to G), and contain sequences 
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coiresponding to bactoriophage A. cooidinates 27619-27738 (GenBank release 
92.0, bg:LAMCG, "Complete Sequence of Bacteriophage Lambda"). 

The sequence of attB produced by recombination of wild type attL and 
attRsitesis: 



B O B' 

attBwt: 5' AGCCT GCTTTTTXftrAC3a4 CTJ5A 3' (SEQ. ID 

llO:31) 

3' TOGGA OGftA& AAAm Tgarr GAACT 5' ■ 

The stopcodons are italicized and undolined. Note Haat sequoices of attL,.attR, 
and attP can be derived from the attB sequence and the boundaries of 
bactoiophage A. contained within attL and attR (coordinates 27619 to 27818). 

When mutant atlRl and attLl sites were lecombined the sequence attBl 
was produced (mutations in bold, large font): 



5' ■^^^^ GCTTTTTtGtACAaA CTTgT 3 <SEQ. ID NO:6) 
3< TCGGA cGAAAAAaCaTgTtT GAAcA 5' 



Note that the four stop codons are gone. 

When an additional mutation was introduced m the attRl and attLl 
sequences (bold), attR2 and stUJl sites resulted. Recombination of attR2 and 
attL2 produced die attB2 site: 

B -0 B' 

attB2: S- MCCT gCTTtCtTOTACAAA CTTGT 3' (SEQ. ID NO = 7) 

3' cxsaaaGaacatgttt 5' 



The recombination activities of the above attL and attR sites wereassayed 
as follows. TliB attB site of plasmid pEZC70S (Figure 2B) was replaced witii 
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attLwt, attLl, or attL2. The attP site of plasnud pEZC726 (Figure 2C) was 
replaced with attRwt (lacking legions PI and HIX attRl, or attR2. Thus, the 
resulting plasmids could lecpmbine via Hbaii loxP sites, mediated by Cre, and via 
their attR and attL sites, mediated by Int,Xis. and IHF. Pabsof plasmids were • 
5 mixed and reacted wi&(>e,Int,^ and IHF,ttansfoniied into £.co/i competent 

cells, and plated on agar containing kanamycia The results are presented in 
Tables: 



Tables 



Vector donor att site 


Gene donor att site 


# of Icanamycin resistant colonies* 


ata(.wt(pEZC130I) 


None 


1 (badcground) 




atlLwt(pEZC1313) 


147 




attLl (pEZC1317) 


47 




attL2(pEZC1321) 


0 


aim (pEZClSOS) 


None 


I (Jaadkffomd) 




attLwt(pEZC1313) 


4 




attLl (pEZC1317) 


128 




attL2(pEZC1321) 


0 


attR2(pEZC1309) 


None 


9 (background) 




attLvrt(pEZCl313) 


0 




attL2{pEZC1317) 


0 




attLZ(pEZC1321) 


209 



(* 1% of each transformation was spread on a kanamycin plate.) 



The above data show that «4iereas the wild type att and atti sites 
recombine to a small extent, the attl and att2 sites do not Tecombine detectably 

25 with each other. 

Part HI. Recombination was demonstrated when the core region of both 
attB sites flanking the DNA segment of interest did not contain stop cbdons. The 
physical state of the participating plasmids was discovered to influence 
recombination efficiency. 

30 TTie appropriate att sites were moved into pEZC705 and pEZC726 to 

make the plasmids pEZC1405 (Figure 5G) (attRl and attR2) and pEZC1502 
(Figure 5TX) (attt.1 and attL2). The desired DNA segment in tUs experiment was 
a copy of the chloramphenicol resistance gene cloned between the two attL sites 
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of pEZC1502. Pairs of plasmids were lecombined in vitro using Int, Xis, and 
IHF (no Cre because no loxP sites were {nesmt). The yield of desired kanamydn 
resistant colonies was detennined viaa both parental plasmids were circular, or 
when one plasmid was circular and llie otiier linear as presented in Table 4: 



Vector donor* 


Goie donor* 


Kanamycin resistant colonies' 


Circular pEZCI40S 


None 


30 


Circular pEZC140S 


Circular pEZClS02 


2680 


Linear pEZC140S 


None 


90 


Linear pEZC140S 


Circular pEZC1502 


172000 


Circular pE2X:i40S 


Linear pEZClS02 


73000 



' DNAs were purified with Qiagen columns, concentrations detennined by 
A260, and linearized with Xba I (pEZC1405) or AlwN I (pEZC1502). Each 
reaction contained 100 ng of the indicated DNA. All reactions (10 ji! total) 
15 contained 3 (li of enzyme mix (Xis, Int, and IHF)- After incubation (4S minutes 

at 25°, 10 minutes at 65 °), one jil was used to transform K coli DHSa cells. 
^ Number of colonies expected if the entire transfonnation reaction (1 ml) had been 
plated. Either 100 |tt or i pi of the transfonnations were actually plated. 

Anafysis: Recombinadonal cloning using mutant attR and attL sites was 
20 confirmed. The desired DNA segmfflit is subclonedbetweoiattB sites that do not 

contain any stop codons in eilber strand. The enhanced ^eld of Product DNA 
(wfaoi one parrait was linear) was une2q)ected because of earlier observations diat 
&e exdsion reaction was more efficient vibssa both participating molecules were 
supetcoiled and proteins were limiting (Nunes-Duby et aL, Cell SOillS-lM 
25 (1987). 



Example 4: Demonstration of Recombmational Cloning Without Inverted 
Repeats 

Parti: Rationale 

The above Example 3 showed that plasmids containing inverted repeats 
30 of the appropriate recombination sites (for example, attLl and attL2 in plasnud 

pEZClS02) (Figure SH) could recombine to give the desired DNA segment 
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flanksd by attB sites vvithout stop codons, also in inverted orientation. A concern 
was the in vivo and in vitro influence of the inverted repeats. For example, 
transcription of a desired DNA segment flanked by attB sites in inverted 
orientation could yield a single stranded RNA molecole ttat mig^ form a hsiipin 
structure, thereby iiihibiting translation. 

Inverted orientation of amilar recombination sites can be avoided by 
pladng file ates in direct repeat aiianganentatt sites. Ifpatentalplasmidseach 
have a vriM Qrpe atO. and wild type attR site, in (Urect repeat die ^ 
ptotdns will simply remove dte DNA segment flanked by those sites in an 
intramoieoilar reaction However, the mutant sites described in fte above 
Ex^ple 3 suggested tiiat it might be possible to inihibit the intramolecular 
reaction «^ aUowing the intermoleculariecombmation to proceed as desired. 

Partn: Structiavof Plasnudsfrohout Inverted Rqteais for Reconaination^ 
Gohing 

■nieattR2sequaiceinplasmidpEZC1405 (Figure 5G) was rq)laced with 
attL2, in tiie opposite orientation, to make pEZC1603 (Figure 6A). The attL2 
sequence of pEZC1502 (Figure 5H) was replaced with attR2, in the opposite 
orientation, to make pEZC1706 (Figure 6B). Each of these plasnuds contained 
mutations in tiie cote region that make intramolecular reactions b^een attl and 
att2 cores veiy inefficient (see Example 3, above). 

Plasmids pEZC1405, pEZC1502. pEZC1603 and pEZC1706 were 
purified on Qiagen columns (Qiagen, Inc.). Aliquots of plasmids pEZC1405 and 
pEZC1603 were linearized witii Xba I. Aliquots of plasmids pEZC1502 and 
pEZC1706 were linearized witii AlwN I. One hundred ng of plasnuds were 
mixed m buflfer (equal volumes of 50 mM Tris HCl pH 7.5, 25 mM Tris HCl pH 
8.0, 70 mM KCl, 5 mM spermidine, 0.5 mM EDTA, 250iig/ml BSA, 10% 
glycerol) containing Lit (43.5 ng% 7Gs (43 ng) and IHF (8.1 ng) in a final volume 
of 10 nl. Reactions were incubated for 45 minutes at 25"'C, 10 minutes at 65°C, 
and 1 jd was transformed into K coli DH5o. Afier ejqpression, aliquots were 
spread on agar plates containing 200 ng/ml kanamycin and uicubated at 37''C. 
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Results, oquessed ais the number of colonies per I |il of lecombination 
reaction are presented in Table S: 



Tables 



Vector Donor 


Gene Donor 


Colonies 


Predicted % product 


Circular 140S 




100 




Circular 1405 


Circular 1502 


3740 


3640/3740 = 97% 


Linear 1405 




90 




Linear 1405 


Circular 1502 


172.000 


171.910/172,000 = 99.9% 


Circular 1465 


Linear 1502 


73.000 


72,900/73.000 = 99.9% 










Circular 1603 




80 




Circular 1603 


Circular 1706 


410 


330/410 = 80% 


Linear 1603 




270 




Linear 1603 


Circular 1706 


7000 


6730/7000 = 96% 


Circular 1603 


UnearI706 


10,800 


10,530/10,800 = 97% 



Analysis. In all configurations, i.e., circular or linear, the pEZC140S x 
pEZC 1 502 pair (with att sites in inverted repeat configuration) was more efficient 
than pEZC1603 x pEZC1706 pair (with att ates mutated to avoid haiipin 
formation). Hie p£ZC1603 x pE2X:i706 pair gave higher backgrounds and 
lower efficiencies than the pEZC1405 x pEZC1502 pair. While less effidoit, 
80% or mote of tiie colonies fiom tiie pEZC1603 x pEZC1706 reactions were 
expected to contain the desired plasmid product. Making one partner linear 
stimulated the reactions in all cases. 

Part in: ConJinnaHon of Product Plasndds' Structure 

Six colonies each from the linear pEZC1405 (Figure 5G) x circular 
pEZC1502 (Figure 5H), circular pEZC1405 x linear pEZC1502. linear 
pEZC1603 (Figure 6A) x circular pEZC1706 (Figure 6B), and circular 
pEZC1603 x linear pEZC1706 reactions were picked into rich medium and 
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ininqnrep DNAs were prepared. Diagnostic cuts with Ssp I gave the predicted 
liestriction fragments for all 24 colonies. 

Analysis. Recombination reactions betweeaplastnids with mutant attL 
and attR sites on die same molecules gave the deared plasniid products widi a 
high d^ree of spedfidty. 

ExanpJe 5: Reeombinational Gotung with a Toxic Gene 
Parti: Background 

Restriction enzyme Dpn I recognizes tiie sequence GATC and cuts fliat 
sequence only if die A is mediylated by die dam mell^lase. Mostcommonfy 
used E coli strains are dam^ E]q)ression of Dpn I in dam^ strains of £ colt is 
ledial because die diromosome of the cell is chopped into mai^ pieces. 
However, in darn' cells expresaon of Dpn I is innocuous because the 
chromosome is immune to Dpn I cutting. 

In the general reeombinational cloning scheme, in which Ihe vector donor 
contains two segments C and D separated by recombination sites, selection for 
the desired product depends upon selection for the presence of segment D, and 
the absence of segment C. In the original Example segment D contained a drug 
resistance gene (Km) that was negatively controlled by a repressor gene found on 
segment C. When C was present, celk containing D were not resistant to 
kanamycm because the resistance gene was turned ofif. 

The Dpn I gene is an example of a tOHC gene that can replace the 
rq)ressor gene of die above enibodiment If s^ment C cjqwfcsses die Dpn I gene 
product, transforming plasmid CD into a dam* host kills die cell. If segment D 
is transferred to a new plasmid, for example by reeombinational clomng, dien 
selecting for the drug marker will be successful because (he tone gene is no 
longer present 
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Part H: ConstructioH of a Vector Donor Using Dpn I as a Toxic Gene 

The gene encoding E>pn I endonuclease was amplified by PGR using 
primers 5'CCA CCA CAA ACG CGT CCA TGG AAT TAG ACT TTA ATT 
TAGS' (SEQ. ID NO: 17) and 5'CCA CCA CAA GTC GAC GCA TGC CGA 
CAG CCT TCC AAA TGT3' (SEQ. ID NO: 1 8) and a plasmid cont^ning the Dpn 
I gene (derived fix>m plasmids obtained fiom Sanfoid A. Lades, Brookfaaven 
National Laboratory, Upton, New Yoil^ also available fiom American T^pe 
Culture Collection as ATCC 67494) as tbs teaaplaSc. 

Additional mutations were introduced into the B and B' regions of attL 
and attR, respectively, by amplifying ensting atfL and atfR dotnains vritfa primers 
contaimng the deaied base dianges. Recombination of tiie mutant ata.3 (made 
with oligo ^115) and attR3 (made with oligo Xsl 12) yielded attB3 vritfa flie 
following sequence (difGsrences fiom attBl in bold): 

B OB' 
15 ACCCA. GKrrTTCTTGXACAIA GKaGT (SEQ. ID 110:6] 

tGggT cgaaaGaacatgttt OvCca. 

The attL3 sequence was cloned in place of attL2 of an existmg Gene Donor 
plasmid to give the plasmid pEZC2901 (Figure 7 A). The attR3 sequence was 
cloned in place of attR2 in an existing Vector Donor plasmid to give plasmid 

20 pEZC2913 (Figure 7B) Dpn 1 gene was cloned into plasmid pEZC2913 to replace 

the tet repressor gene. The resulting Vector Donor plasmid was named 
pEZCSlOl (Figure 7C). When pEZCSlOl was transformed into the dam" stram 
SCSUO (Stratagene), hundreds of colonies resulted. When die same plasmid was 
transformed into tiie daoH- strain DH5a, only one colony was produced, even 

25 though the DH5a cells were about 20 fold more competent than the SCSI 10 

cells. When a related plasmid that cUd not contain the Dpn I gene was 
transformed into die same two cell lines, 28 colonies were produced bom the 
SCSI 10 cells, ■vAalc 448 colonies resulted fiom the DH5a cells. This is evidence 



5 



10 
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that tbc Dpn I gene is being esqnessed on plasmid pEZCSlOl (Figure 7C), and 
diat it is killii^ die dam* DHSa cells but not Ifae dam- SCSI 10 cells. 

Part IE: DemoiuOtOwnofRecombinathnaiaoning Using Dpn I Selection 

A pair of plasmids was used to danonsbate recombinadonal cloning with 
sdection for product d^dent vpon tbs toxic gene Dpn I. Plasmid pEZCSlOl 
figure 7C) was l inear i zed vrifli Mlu I and reacted with circular plasmid 
pEZC2901 (Figure 7A). A second pair of plasmids uang selection based on 
control of drug resistance by a repressor gene was used as'a control: plasmid 
pEZC1802 i^igure 7D) was linearized with Xba I and reacted with drcular 
plasmid pEZC15Q2 figure SH). Ei^ nucroliter'reactions containing die same 
bu£fer and protdns IGs, Int, and IHF as in ^evious examples were incubated for 
45 niinntes at 25°Q dien 10 niinutes at TS'C, and 1 aliquots woe transformed 
into DH5a (Le., dam4-) competent cells, as presented in Table 6. 



Table 6 



Reaction # 


Vector donor 


Basis of selection 


Gene donor 


Colonies 


1 


pEZC3101/Mlu 


Dpn I toxicity 




3 . 


2 


pEZC310l/Mlu 


Dpn I toxicity 


Circular pEZC2901 


4000 


3 


pEZC1802«ba 


Tet fiqtressor 




0 


4 


pEZC1802«ba 


Tetrquessor 


Circular pEZClSQ2 


12100 



Miniprep DNAs Were prepared fiom four colonies fiom reaction 1t2, and 
cut with restriction en^meSsp I. All gave the predicted fragments. 

Analysis: Subclonmg using selection with -a toxic gene was 
demonstrated. Plasmids ofthe predicted structure were produced. .. 
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Exmq>le 6: Goning of Genes mth UracU DN4 Glycosylase and Subcloniug 
of the Genes wiOt Recombinaeonal Ootdng to Make Fusion Protons 

Part I: Converting an Existing Expression Vector to a Vector Donor for 
ReconAinaOonal Cloning 

5 A cassette useful for converting existing vectors into functional Vector 

Donors was made as follows. PlasmidpEZC3101 (Figure 7C) was digested widi 
Apa I and Kpn I, U^ated widi T4 DNA polymerase and dNTPs to teada tlie aids 
blunt, fiuther digested vnth Sma I, Hpa I, and AlwN I to render the undesirable 
DNA fiagments small, and the 2.6 kb cassette containing the attRl - Cm*^ - Dpn I 

10 - attR.-3 domains was gel purified. The concentration of the purified cassette was 

estimated to be about 75 ng DNA/(iI. 

Plasmid pGEX-2TK OFiguie 8A) (Pbamiada) allows fiidons between die 
piofran glutafliione S transferase and any second coding sequence that can be 
inserted in its multiple cloning site. pGEX-2TK DNA was digested widi Sma I 

15 and treated with alkaline phosphatase. About 75 ng of the above purified DNA 

cassette was ligated •mOi about 100 ng of die pGEX-2TK vector for 2.S hours in 
a 5 )d ligation, then 1 111 was transformed into competent BRL 3056 cells (a dam- 
dsivative of DHIOB; dam' strains commercially available include DM! finm 
Life Tedmologies, Inc., and SCS 110 finm Stratagene). Aliquots of the 

20 transfdimation mbcture were plated on LB agar contiuning 1 00 (ig/ml ampicillin 

(resistance gene present on pGEX-2TK) and 30 (xg/ml chloramphoiicol 
(resistance gene present on the DNA cassette). Colonies were picked and 
miniprep DNAs were made. The orientation of the cassette in pGEX-2TK was 
d^ennined by diagnostic cuts with EcoR I. A plasmid widi the desired 

25 orientation was named pEZC3501 (Figure 8B). 
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Part 11: CUtning Reporter Genes Into an Recombinational Cloning Gene 
Donor Plasmid in Three Reading Frames 



Uradl DNA ^ycosylase (UDG) cloning is a method for cloning PCR 
anqilificiation products into cloning vectors (U.S. patent No. 5^34,515, entirely 
incorporated herein by reference). Briefly, PCR amplification of tiie desired 
DNA segment is performed with primers that contain uracil bases in place of 
Aymidine bases in flieir 5' ends. When such PCR products arc incubated wifli the 
enzyme UDG, the uracil bases are specifically removed. The loss of these bases 
■weakens base pairing in die ojds of flie PCR product DNA, and v*en incubated 
at a suitable temperature (e.g., ST'C), the ends of sudi products are largely smgle 
stranded. If such incubations are done in the presence of linear cloning vectors 
containing protruding 3' tails that are complementary to the 3' ends of the PCR 
products, base pairing efiSdently anneals the PCR products to the cloning vector. 
When the annealed product is introduced into E. coli cells by transformation, in 
vivo processes efSciently convert it into a recombinant plasmid. 

UDG cloning vectors that enable cloning of any PCR product in all three 
reading fiames were prepared fiom pEZC3201 (Figure 8K) as follows. Eight 
oligonudeotides were obtsuned fiwm Life Technologies, Inc. (all written 5' - 3': 
ril top (GGCC GAT TAC GAT ATC CCA ACG ACC GAA AAC CTC TAT 
TTT CAG GGT) (SEQ. ID N0:19), rfl bottom (CAG GTT TTC GGT CGT 
TGG GAT ATC GTA ATCXSEQ. ID NO:20), rf2 top (GGCCA GAT TAC 
GAT ATC CCA ACG ACC GAA AAC CTG TAT TTT CAG GGT)(SEQ. ID 
N0:21), rt? bottom (CAG GTT TTC GGT CGT TGG GAT ATC GTA 
ATC TXSEQ. ID NO:22), rB top (GGCCAA GAT TAC GAT ATC CCA ACG 
ACC GAA AAC CTG TAT TTT CAG GGTKSEQ. ID NO:23), rC bottom 
(CAG GTT TTC GGT CGT TGG GAT ATC GTA ATC TT)(SEQ. ID NO:24), 
carbojg' top (ACC GTT TAC GTG GAC)(SEQ. ID NO:25) and carboxy 
bottom (TCGA GTC CAC GTA AAC GGT TCC CAC TTA TTAXSEQ. ID 
NO:26). The rfl, 2, and 3 top strands and the carboxy bottom strand were 
phosphorylated on their 5' ends wifli T4 polynucleotide kinase, and then the 
complementary strands of each pair were hybridized. Plasmid pEZC3201 
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(Figure 8K) was cut vrith Not I and Sal I. and aliquots of cut plasmid were mixed 
with the caiboxy-oligo duplex (Sal 1 end) and either the rfl, r£2. or rf3 duplexes 
(Not I ends) (10 jig cut plasmid (about 5 pmol) mixed with 250 pmol carboxy 
oligo duplex, split into three 20 (il volumes, added 5 jil (250 pmol) of rfl , rf2, or 
5 rf3 duplex and 2 jil = 2 units T4 DNA ligase to each reaction). After 90 minutes 

of ligation at room temperature, each reaction was applied to a preparative 
aigarose gel and die 2.1 kb vector bands were eluted and dissolved in 50 fU of TE. 

Part lU: PCR ofCATandphoA Genes 

Primers were obtained fiom Life Technologies, Inc.. to amplify the 

id diloiamphenicol acetyl transferase (CAT) gene fixim plasmid pACYCl 84, and 

phoA, the alkaUne phosphatase gene fiom £ coli. The primers had 12-base 5' 
extoiaons containing uracil bases, so tiuit treatment of PCR products widi uracil 
DNA glycosylase (UDG) would weaken base pairing at each end of the DNAs 
and allow die 3' strands to anneal with the protruding 3' ends of the rfl, 2, and 3 

15 vectors described above. The sequences of tiie primers (all written 5' - 3*) 

were: CAT left, UAU UUU CAG GGU ATG GAG AAA AAA ATC ACT GGA 
TAT ACC (SEQ. ID NO:27); CAT right, UCC CAC UUA UUA CGC CCC 
GCC CTG CCA CTC ATC (SEQ. ID NO:28); phoA left, UAU UUU CAG GGU 
ATG CCT GTT CTG GAA AAC CGG (SEQ. ID NO:29); and phoA right, UCC 

20 CAC UUA UUA TTT CAG CCC CAG GGC GGC TTT C (SEQ. ID NO:30). 

The primers were then used for PCR reactions using known method steps {see, 
e.g., U.S. patent No. 5,334,515, entirely incorporated herein by reference), and 
the polymerase chain reaction amplification products obtained with these primers 
comprised the CAT or phoA genes with the initiating ATGs but without any 

25 transoiptional signals. In addition, tte uracil-containing sequences on the amino 

tennini encoded flje cleavage site for TEV protease (Life Technologies, Inc.), and 
those on the caiboxy terminal encoded consecutive TAA nonsense codons. 

Unpurified PCR products (about 30 ng) were nuxed with the gel purified, 
Unear ifl, r£2, or rf3 cloning vectors (about 50 ng) in a 10 ^l reaction contaming 

30 IX REact 4 buffer (LTI) and 1 unit UDG (LTI). After 30 minutes at yTC, 1 jil 
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aliquots of each reaction were transfonned into competent E. coU DH5a ccUs 
(LIT) and plated on agar containing 50 fig/ml kanamycin. Colonies were picked 
and aoalyas of miniprep DNA showed that the CAT gene had been cloned in 
reading fiame 1 (pEZC3601XFigure 8Q. reading frame 2 (pEZC3609)(Figurc 
8D) and reading fiame 3 ft)EZC3617XFigurc 8E), and that the phoA gene had 
been cloned in reading fiame 1 0>EZC3606)(Figiire 8F), reading fiame 2 
(pEZC3613XFigure 8G) and reading fiame 3 ^EZC3621XFigure 8H). 

Part IV: SabdoningofCATorphoAfivm VDG Ooiung Vectdn iiito a GST 
Fusion Vector 

Plasmids encoding fiisions between GST and either CAT or phoA in all 
fliree reading frames were constructed by recombinationa] cloning as follows. 
Miniprep DNA of GST vector donor pEZC350I(Figure 8B) (derived from 
Pharmacia plasmid pGEX-2TK as described above) was linearized with Cla I. 
About 5 ng of vector donor were mixed with about 1 0 ng each of the q)propriate 
dreuiar gene dcmor vectors containir^ CAT or phoA in 8 jU reactions containing 
buffer and recombination protems Int. Xis, and IHF (above). After incubation, 
1 |xl of each reaction was transfonned into £ coli strain DH5o and plated on 
■ ampicillln, as presented in Table 7. 



Tabu 7 



20 



25 



DNA 


Colonies (10% of each transfonnation) 


Linear vector donor (pEZC3S01/CIa) 


0 


Vector donor + CAT ffl . 


110 


Vector donor + CAT rf2 


71 


Vector donor + CAT rfi 


148 


Vector donor + phoA rfl 


121 


Vector donor + phoA rf2 


128 


Vector donor + phoA rf3 


31 
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Part V: Expression of Fusion Proteins 
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Two colonies from each tiansfonnadon were picked into 2 ml of rich 
medium (CircleGiow, BiolOl Inc.) in 17 x 100 mm plastic tubes (Falcon 2059, 
Becton Dickinsoa) containing 100 (ig/ml ampicillin and shaken vigorously for 

5 about4hoursat37''C, at which time the cultures were visibly turbid. One ml of 

each culture was transferred to a new tube containing 10 |il of 10% (w/v) IPTG 
to induce expression of GST. After 2 hours additional incubation, all cultures had 
about the same turbidity; the A600 of one culture was 1.5. Cells from 0.35 ml 
each culture were harvested and treated widi sample buffer (containing SDS and 

10 P-mercq>toethanol) and aliquots equivalent to about 0.15 A600 units of cells 

were applied to a Novex 4-20% gradient polyaoylamide gel. Following 
elec&ophoresis the gel was stained witili Coomassie blue. 

Results: Enhanced eiqirKsion of angle protdn bands was seen for alll2 
cultures. The observed saes of diese proteins correlated well with the sizes 

15 predicted for GST bemg fused (through attB recombination sites witiiout stop 

codons) to CAT or phoA in three reading fiames: CAT rfl = 269 amino acids; 
CAT r£2 = 303 ammo acids; CAT r£3 = 478 ammo acids; phoA rfl = 282 ammo 
acids; phoA i£2 = 280 ammo acids; and phoA rf3 = 705 anuno acids. 

Analysis: Both CAT and phoA genes were subcloned into a GST fiision 

20 vector in all three reading fiames, and e]q>ression of the six flision proteins was 

demonstrated. 

While the foregoing invention has been described in some detail for 
puqwses of clarity and understanding, it will be appreciated by one skilled in the 
art from a reading of this disclosure that various changes in form and detail can 
25 be made widiout depaxtii^ from the tme scope of the invention and appended 

claims. All patents and publications cited hoem are entirely incorporated herein 
by reference. 
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Whatls Claimed is: 

1. AVectorDonorDNAmoleculecoinprifflngafirstDNAsegment 
and a second DNA segment, said first or second DNA segfueat containing at least 
one Selectable marker, wiierem the first and second segments are sqjaiated dther 
by, (i) in a circular Vector Donor, a fiast and a second recombination site, or 
Cu) in a linear Vector Donor, at least a first recomWnation ate, w*erein eadi pair 
of flanking recombination sites are engineered and do not lecombine wfli each 
other. 

2. A Vector Donor DNA molecule according to clam 1, wbsKm Ha 
Selectable maricer is alleast one DNA se^ent selected fiomfliegioiq) consisting 
of: 

0) a DNA segment that encodes a product that provides 
resistance against odierwise toxic compounds; 

01) a DNA segment that encodes a product fliat is otherwise 
lacldng ui die recipient cell; 

(iii) a DNA segment that encodes a product that suppresses tire 

activity of a gene product; 
Ov) a DNA segment tiiat encodes a product tiiat can be readily 

identified; 

(v) a DNA segment fliat encodes a product fliat is detrimental 
to cell survival and/or fimction; 

(vi) a DNA segment that inhibits tiie activity of any of tire 
DNA segments of (i)-(v) above ; 

(vii) a DNA segment fliat binds a product tfiat modifies a 
substrate; 

(viii) a DNA segment tiiat provides for isolation of a desired 
molecule; and 

(ix) a DNA segment fliat encodes a specific nucleotide 
sequence v*ich can be oflierwise non-fimctional; and 
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(x) a DNAsegmCTt that, whra absent, directly or indirectly 
confers sensitivity to particular conqmunds. 



3. A Vector Donor DNA according to claim 2, wherein said 
Selectable- marker is at least one selected from the group consistii^ of an 

5 antibiotic resistance gene, a tRNA gene, an auxotrophic marker, a toxic gene, a 

phenotypic marker, an antisense oligonucleotide; a restriction endonuclease; a 
restriction endonuclease cleavage site, an enzyme cleavage site, a proton binding 
site; and a sequence complementary PGR primer. 

4. AVectorDonorDNAmoleculeaccordingtoclaiml,\vlierdnsaid 
10 Selectable marker comprises at least one inactive fii^Mnt of a Selectable 

marker, vdierein the inactive fiagment is capable of leconstitutiDg a fimctional 
Selectable marker vibea recombined across said first or second lecombination ate 
mtfa a fijitiier DNA segment comprisir^ another inactive fiagment of the 
Selectable marker. 



S. An Insert Donor DNA molecule, comprising a desired DNA 
segment flanked by a first recombination site and a second recombination site, 
wherein the first and second recombination sites are engineered and do not 
recombine witii eadi other. 



6. An Insert Donor DNA molecule according to claim 5, wherein 
20 said desired DNA segment codes for at least one selected fix)m the group 

consisting of a cloning site, a restriction site, a promoter, an'operon, an origin of 
replication, a fiinctional DNA, an antisense RNA, a PGR fiagment,-a protein or 
a protein firagment 

7. A kit conqirising a container being compartmentalized to receive 
25 in close confinement therein at one compartment, wherein a first compartment 

contains a Vector Donor DNA molecule comprising a first DNA segment and a 
second DNA segmoit, said first or second DNA segment containing at least one 
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Selectable maiker, indierein die first and second segments are flanked either by, 
(i) in a circular Vector Donor, a first and a second recombination site, or (ii) m 
a linear Vector Donor, a first recombination site, \(4ieiein each pair of flanking 
recombination sites are engineered and do not recombine vndi each other. 

8. A kit according to claim 7, further conqnising a second 
conq)artmait containing an Insot Donor DNA molecule conq)iising a desired 
DNA s^ment flaiiked by a first recombination site and a second recombination 
ate, vt4ierein the first and second recombination sites are engineeied and do not 
recombine with each other. 

9. A kit according to chum 7, fiudier comprising an additional 
compartment containmg at least one recombination protem ci^le of 
recombming a DNA segment comprising at least one of said recombination ates. 

10. A nuddc acid molecule, comprising at least one DNA segment 
having at least two recombmation sites flanking a Selectable marker or a desired 
DNA segment, wherein at least one of said recombination sites comprises a core 
region having at least one engineered mutation that enhances recombmation in 
vitro in the formation of a Cointegrate DNA or a Product DNA. 

11. A nucleic acid molecule according to claim 10, viicrem said 
mutation confers at least one enhancement of said recombination, said 
enhancement selected fmm the gcoxtp consisting of substantially (i) favoring 
excisive recombinatioi^ Qi) &voring integrative recombination; (iii) relieving the 
requirement for host fectors; (iv) increasing the efficiency of said Cointegrate 
DNA or Product DNA formation;(v) inaeasmg the spedficitjr of said Cointegrate 
DNA or Product DNA formation; and contributes desirable attributes to the 
Product DNA. 
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12. A nucleic add molecule aocoiding to clam 10, wherein said 
lecombination site is derived fiom at least one lecombination site selected fixmi 
fbs group consisting of attB, attP, attL and atlR. 

13. A nucleic acid molecule according to claim 12, wdieiein said att 
ate is selected from the giotq>s consisting of attl, at£2 and att3. 

14. A nucleic acid accoiding to daim 10, wherein said core region 
comprises a DNA sequence selected from the group consisting of: 

(a) RKYCWGCnTYOTR.TACNAASTSGB(m-att)(SEQIDN0:l); 

(b) AG<XWGCTTIYKTRTACNAACTSGB (m-attB) (SEQ ID 

N0:2); 

(c) GTTCAGCnTCkTRTACNAACTSGB (m-attR) (SEQ ID 

N0:3); 

(<0 AGCCWGCTTTCKTRTACNAAGTSGB (m-attL) (SEQ ID 

N0:4); 

(e) GTTCAGCnTyKTRTACNAAGTSGB(m-attPl) (SEQ ID 

N0:5); 

and a corresponding or cpmplemenlary DNA or RNA sequence, whaein R=A or 
G; K=G or T/U; Y=C or T/U; W=A or TAJ; N=A or C or G or TAJ; S=Cor G; 
andB=CorGorTAJ. 



15. A nucleic acid according to claim 14, vdierein said core region 
comprises a DNA sequence selected from the group consisting of: 

(a) AGCCTGCTTTTTTGTACAAACTTGT(attBl)(SEQlDN0:6); 

(b) AGCCTGCTlTCTTGTACAAACTTGT(atlB2) (SEQ 1DN0:7); 

(c) ACXXIAGCnTCTrcTACAAACTFGT (attB3) (SEQ ID N0:8); 

(d) GTrcAGCrmTTTGTACAAACTTGT(attRl)(SEQlDNO:9); 

(e) GTrCAGCnTCTrGTACAAACTTGT(attR2)(SEQIDNO:10); 
(0 GTTCAGCTTTCTTGTACAAAGTTGG (attR3) (SEQ ID 

N0:11); 
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(g) AGCXTGCnrnTGTACAAAGTTGG (attLl) (SEQ ID 
N0:12); 

(h) AGCCTGCTTTCTTGTACAAAGTTGG (attL2) (SEQ ID 
N0:13); 

(i) ACCCAGCTTTCTTGTACAAAGTTGG (attL3) (SEQ ID 
N0:14); 

0) <nTCAGCTrrrrrGTACAAAGTrGG(attPl)(SEQIDN0:l5); 

(k) GTTCAGCntCTTGTACAAAGTTGG (aaP2.P3) ^EQ ID 

and a conresponding oi conq>Iementaiy DNA or RNA sequence. 

16. A mefliodfi)! making a nucleic acid molecule, comprising 
providing a nucleic acid molecule having at least one engineered 

recombination site comprising at least one DNA sequence haying at least 90% 
homology to at least one of SEQ ID N0S:1-16. 

17. A nucleic acid molecule provided by a mediod according to 
claim 16. 

18. A composition, comprising a nucleic acid molecule accoiding to 



19. A kit, comprising a container being compartmentalized to receive 
in close confinement therein at least one conq)artment, wherein a first 
compartment contains a composition according to claim 18'. 



20. A kit according to claim 19, fiirther comprising a second 
compartment having at least one recombmation protdn that recognizes said 
recombination site. 
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21. A kit comprising a container bong compaitmentalized to re^ 
in close confinement therein at least one recombination proton in isolated form, 
useful for a method according to claim 22. 

22. A method of making a Cointegrate DNA molecule, comprising 
combining in vitro: 

(i) an Insert Donor DNA molecule, comprising a desired 
DNA segment flanked by a first recombination site and a 
second recombination site, wherein the first and second 
recombination ates do not tecombine with each other; 
Oi) a Vector Donor DNA molecule containing a third 
recombination site and a fourth recombination site, 
wherein die third and fourth recombination sites do not 
recombine vnth eadi other; and 
Gii) at least one site spedfic recombination protein capable of 
recombining said first and third recombinatianal sites said 
second and fourdi recombinational sites; 
thereby allowing recombination to occur, so as to produce a Cointegrate DNA 
molecule comprising said first and fliird or said second and four* recombination 



23. A method according to claim 22, wherein a Product DNA 
molecule is produced fiom said Coint^rate DNA by recombining at least one of 
(i) said first and third, or (li) said second and fourdi, recombination sites, said 
Product DNA coiqjrising said desired DNA segment 

24. A mediod according to claim 23, wherein said mediod also 
produces a Byproduct DNA molecule. 



25. A mediod according to claim 23, fiirther comprising selecting for 
die Product DNA molecule. 
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26. Amethodasclaimedinclaim22,wheremliieVectbrI^^ 
molecule comprises a vector segment flanked by said fliird and the fourth 

tecombinadon sites. 

27. A method as claimed in claim 22. wherem the Vector Donor DNA 
molecule further comprises (a) a toxic gene and (b) a Selectable marker, vrfierem 
the toxic gene and the Selectable maricer are on diff&ent DNA segments, the 
DNA segments bemg separated either by 0) m a circular DNA molecule, two 
recombination sites, or GO in a linear DNA molecule, one recombination site. 



28. Amediodasclaimedindaim22,w*ereinflieVectorDonorDNA 
molecde fiotfaer comprises (a) a rqnession cassette and (b) a Selectable maricer, 
tepcessed by die rqiressor of die rqnession cassette, and vAerein die Selectable 
marker and die repression cassette are on difiBaent DNA segments, the DNA 
segments being separated eidier by. (i) in a dicular DNA molecule, two 
recombination sites, or Cu) in a linear DNA molecule, one recombination site. 

29. Aniefliodasclaimedinclaim22.w*ereinatleastoneofdieInsert 
Donor DNA molecule and the Vector Donor DNA molecule is a circular DNA 
molecule. 

30. Amediodasclaimedinclaim22,M4iereinatleastoneoftheInsert 
Donor DNA molecule and the Vector Donor DNA molecule is a linear DNA 

. molecule. 

31. A method as claimed in claim 22. vAtcKux die sele<^ step is 
carried out in vitro ot in vivo. 

32. A mefliod as cldmed in claim 22, vrtierein said recombination 
protem comprises at least a first recombination protein and a second 
recombination protem, said second recombmation protein being different from 
said first recombination protein. 
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33. AineAodas<daimedmclaim22,wheirfnsaidrecombi^ 
protdnislnt 

34. A method as claimed in claim 22, v*erein the at least one 
recombination protein is selected fiom (i) lot and IHF and Cii) Int. Xis, and IHF. 
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